Version 1 (modified by 15 years ago) (diff) | ,
---|
VisDic XML format
XML files in VisDic consist of tags and their values. Value of the TAG tag is enclosed in strings <TAG> and </TAG>. Tags can be nested, which means that each tag can contain another tag. White characters like spaces, tabs and new-lines at the start or at the end of each tag value are trimmed. However, XML files parsed by VisDic are quite different from the common ones in these points:
- XML dictionaries contain entries. Each entry is in fact represented by one small XML file. There is no tag enclosing the whole dictionary.
- XML tag has no attributes.
Example of one Wordnet synset:
<SYNSET> <ID>ENG21-00001740-n</ID> <POS>n</POS> <SYNONYM> <LITERAL>entity<SENSE>1</SENSE></LITERAL> </SYNONYM> <DEF>that which is perceived or known or inferred to have its own distinct existence (living or nonliving)</DEF> <BCS>2</BCS> <DOMAIN>factotum</DOMAIN> </SYNSET>