W0182

Smart CIF Dictionaries for Crystallographic Knowledge Bases. S.R. Hall, N. Spadaccini, D. du Boulay, I.R. Castleden, Departments of Crystallography and Computer Science & Software Engineering, The University of Western Australia, Perth, WA 6009, Australia.

The CIF dictionaries (core, pd, mm) [1] are used widely for validating data submitted to structural science journals and databases. These applications are, however, restricted by the inability to specify within the dictionaries the precise relationship between a derivative data item and other defined items. This paper will describe a new prototype dictionary language, StarDDL, which uses text method attributes to specify precise relational expressions for linking defined data items. The expressions are written in a customised symbolic scripting language called dREL [2].

A StarDDL dictionary may be compiled into Java and used as a knowledge base attached to any CIF. Data may be then requested from the CIF, and will be returned either as those instantiated values on the file, or as values derived from the dictionary methods. This knowledge base approach may be applied to either the derivation of new values, or the validation of existing values, for any scientific discipline.

A prototype core CIF dictionary written in StarDDL will be used to demonstrate the automatic evaluation of crystallographic quantities such as molecular geometry and structure factors.

[1] www.iucr.org/iucr-top/cif/#dics
[2] Spadaccini, N., Hall, S.R., and Castleden, I.R. (2000) J. Chem Inf. Comp. Sc. 40, 1289-1301.