File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/85/e85-1025_concl.xml

Size: 2,732 bytes

Last Modified: 2025-10-06 13:56:04

<?xml version="1.0" standalone="yes"?>
<Paper uid="E85-1025">
  <Title>TOWARDS A DICTIONARY SUPPORT ENVIRONMENT FOR REALTIME PARSING ABSTRACT</Title>
  <Section position="7" start_page="176" end_page="176" type="concl">
    <SectionTitle>
CONCLUSION
</SectionTitle>
    <Paragraph position="0"> The research reported in this paper demonstrates that it is both possible and useful to restructure the information contained in LDOCE for use in natural language processing systems. Most applications for natural language processing systems will require vocabularies substantially larger than those typically developed for theoretical or demonstration purposes and it is often not practical, and certainly never desirable, to generate these by hand. The use of machine-readable sources of published dictionaries represents a practical and feasible alternative to hand generation.</Paragraph>
    <Paragraph position="1"> Clearly, there is much more work to be done with LDOCE in the extension of the use of grammar codes and the improvement of the word sense classification system. Similarly, there is a considerable amount of information in LDOCE which we have not attempted to exploit as yet; for example, the box codes, which contain selection restrictions for verbs or the subject codes, which classify word senses according to the Merriam-Webster codes for subject matter (see Walker &amp; Amsler (1983) for a suggested use for these). The large amount of semi-formalised information concerning the interpretation of noun compounds and idioms also represents a rich and potentially very useful source of information for natural language processing systems. In particular, we intend to investigate the automatic generation of phrasal analysis rules from the information on idiomatic word usage.</Paragraph>
    <Paragraph position="2"> In the longer term, it is clear that no existing published dictionary can meet all the requirements of a natural language processing system and a substantial component of the research reported above has been devoted to restructuring LDOCE to make it more suitable for automatic analysis. This suggests that the automatic construction of dictionaries from published sources intended for other purposes will have a limited life unless lexicography is heavily influenced by the requirements of automated natural language analysis. In the longer term, therefore, the automatic construction of dictionaries for natural language processing systems may need to be based on techniques for the automatic analysis of large corpora (eg. Leech et al., 1983). However, in the short term, the approach outlined in this paper will allow us to produce a sophisticated and useful dictionary rapidly.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML