Hello! I am writing this as a short update on the progress of my dictionary project. As of today I have a series of programs capable of translating the GNTDICT into a TEI conforming format. A short sample of the final format is included after this note. The programs seem to perform smoothly overall, although they are not intelligent enough to take care of inconsistencies in the original dictionary. The change in size of the two formats is significant, but not unexpected. When transforming the sample, it changed from about 17.5KB to about 81.5KB. I am currently working on a parser/compiler for the TEI dictionary. When completed it should be able to read in a TEI conforming dictionary, transform it to a compact form, and build tables to allow fast retrieval of the desired entry. I am hoping to make significant progress on this next phase of the project by the end of the Fall semester. Thanks for your time and attention. Steve ----------------------------(ENCLOSURES)------------------------------- <entry id=1> <form> <orth><gphrase lang=greek>A<\orth> <\form> <sense> <sn>1<\sn> <descrip> <\gphrase><gloss>alpha <\gloss>(first letter of the Greek alphabet) <\descrip> <\sense> <sense> <sn>2<\sn> <descrip> <gloss>first <\gloss>(in titles of NT writings) <\descrip> <\sense> <\entry> <entry id=2> <form> <orth><gphrase lang=greek>*)AARW/N<\orth> <\form> <pos> <\gphrase>m <\pos> <sense> <sn>1<\sn> <descrip> <gloss>Aaron <\descrip> <\sense> <\gloss><\entry> <entry id=3> <form> <orth><gphrase lang=greek>*)ABADDW/N<\orth> <\form> <pos> <\gphrase>m <\pos> <sense> <sn>1<\sn> <descrip> <gloss>Abaddon, Destroyer <\gloss>(Hebrew name of a demon transliterated int <\descrip> <\sense> <\entry> <entry id=4> <form> <orth><gphrase lang=greek>A)BARH/S<\orth> <affix>E/S<\affix> <\form> <sense> <sn>1<\sn> <descrip> <\gphrase><gloss>of no <\gloss>(financial) <gloss>burden <\descrip> <\sense> <\gloss><\entry> <entry id=5> <form> <orth><gphrase lang=greek>ABBA<\orth> <\form> <pos> <\gphrase>m <\pos> <sense> <sn>1<\sn> <descrip> <gloss>Father <\gloss>(of address to God) (Aramaic word) <\descrip> <\sense> <\entry> <entry id=6> <form> <orth><gphrase lang=greek>*)/ABEL<\orth> <\form> <pos> <\gphrase>m <\pos> <sense> <sn>1<\sn> <descrip> <gloss>Abel <\descrip> <\sense> <\gloss><\entry> <entry id=7> <form> <orth><gphrase lang=greek>*)ABIA/<\orth> <\form> <pos> <\gphrase>m <\pos> <sense> <sn>1<\sn> <descrip> <gloss>Abijah: <\gloss>(1) person in the genealogy of Jesus (Mt 1.7) <\descrip> <\sense> <sense> <sn>2<\sn> <descrip> (2) founder of a tribe of priests (Lk 1.5) <\descrip> <\sense> <\entry> |
There is something wrong with the first entry's balancing.
i.e., the GPHRASE tag doesn't nest inside the ORTH and FORM tags, but instead ends after the DESCRIP begins? Is this an automated conversion program bug? It is illegal SGML. <entry id=1> <form> <orth><gphrase lang=greek>A<\orth> <\form> <sense> <sn>1<\sn> <descrip> <\gphrase><gloss>alpha <\gloss>(first letter of the Greek alphabet) <\descrip> <\sense> <sense> <sn>2<\sn> <descrip> <gloss>first <\gloss>(in titles of NT writings) <\descrip> <\sense> <\entry> |
Free forum by Nabble | Edit this page |