File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/89/p89-1012_abstr.xml
Size: 1,431 bytes
Last Modified: 2025-10-06 13:46:47
<?xml version="1.0" standalone="yes"?> <Paper uid="P89-1012"> <Title>DICTIONARIES, DICTIONARY GRAMMARS AND DICTIONARY ENTRY PARSING</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> ABSTRACT </SectionTitle> <Paragraph position="0"> We identify two complementary p.ro.cesses in. the conversion of machine-readable dmUonanes into lexical databases: recovery of the dictionary structure from the typographical markings which persist on the dictionary distribution tapes and embody the publishers' notational conventions; followed by making explicit all of the codified and ellided information packed into individual entries.</Paragraph> <Paragraph position="1"> We discuss notational conventions and tape formats, outline structural properties of dictionaries, observe a range of representational phenomena particularly relevant to dictionary parsing, and derive a set of minimal requirements for a dictionary grammar formalism. We present a general purpose dictionary entry parser which uses a formal notation designed to describe the structure of entries and performs a mapping from the flat character stream on the tape to a highly structured and fully instantiated representation of the dictionary. We demonstrate the power of the formalism by drawing examples from a range of dictionary sources which have been processedand converted into lexical databases.</Paragraph> </Section> class="xml-element"></Paper>