File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/98/p98-2161_concl.xml

Size: 2,637 bytes

Last Modified: 2025-10-06 13:58:09

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-2161">
  <Title>Integration of Large-Scale Linguistic Resources in a Natural Language Understanding System</Title>
  <Section position="6" start_page="981" end_page="982" type="concl">
    <SectionTitle>
5 Evaluation
</SectionTitle>
    <Paragraph position="0"> We analyzed a small corpus of 1330 sentences (on the subject of our NLU system) in order to give a quantitative description of the contribution of our lexicon and semantics servers. Our corpus contained forms of 526 distinct roots. Over 60% of these roots had definitions in our core vocabulary.</Paragraph>
    <Paragraph position="1"> Definitions for an additional 25% were extracted from the lexicon server. Analysis of the remaining 71 roots showed that a developer would have needed to enter definitions for 20 common nouns, 2 verbs, and 2 adjectives; the rest were truly proper nouns as assigned by default. The 24 roots not  covered were for the most part instances of technical jargon for our domain? For the 215 verbs in our corpus, again over 60% had semantic rules in our core NL Engine. Our semantics server contributed rules for an additional 38%, leaving our developer with the need to write rules (or rely on guessed default rules) for only 2 verbs. These results are summarized in Table 1.</Paragraph>
    <Paragraph position="2"> Thus, in this application the servers would have enabled the developer to avoid creating 132 lexical entries and 82 semantic rules. In addition, the default mechanism would have eliminated the need for manual entry of 47 more lexical entries.</Paragraph>
    <Paragraph position="3"> in core in server  We have successfully integrated diverse large-scale linguistic resources, both externally and internally compiled, using a client-server architecture, for use with a general-purpose natural language understanding system. The conversion of resources such as Comlex and WordNet into a format usable by our system was straightforward, and the resulting complex of resources executes without any performance problems in a multi-user environment. The task of a developer of a particular natural language application is greatly simplified by the presence of these resources.</Paragraph>
    <Paragraph position="4"> In the future we plan to incorporate WordNet information for verbs into our KB server, and to add semantics rules for the remaining Comlex verbs into the semantics server. We also expect to augment the semantics server with semantic class constraints on the fillers of roles such as agent, and to create a fifth server, containing selection constraints.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML