File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/99/p99-1061_intro.xml

Size: 2,055 bytes

Last Modified: 2025-10-06 14:06:58

<?xml version="1.0" standalone="yes"?>
<Paper uid="P99-1061">
  <Title>A Bag of Useful Techniques for Efficient and Robust Parsing</Title>
  <Section position="4" start_page="0" end_page="473" type="intro">
    <SectionTitle>
2 Precompiling the Lexicon
</SectionTitle>
    <Paragraph position="0"> Lexicon entries in the development system are small templates that are loaded and expanded on demand by the typed feature structure system. Thereafter, all lexical rules are applied to the expanded feature structures. The results of these two computations form the input of the analysis stage.</Paragraph>
    <Paragraph position="1"> 1VERBMOBIL (Wahlster, 1993) deals with the translation of spontaneously spoken dialogues, where only a minor part consists of &amp;quot;sentences&amp;quot; in a linguistic sense. Current languages are English, German, and Japanese.</Paragraph>
    <Paragraph position="2"> Some of the methods were originally developed in the context of another HPSG environment, the LKB (Copestake, 1998). This lends support to our claims of their independence from a particular parser or grammar engine.  In order to save space and time in the run-time system, the expansion and the application of lexical rules is now done off-line. In addition, certain parts of the feature structure are deleted, since they are only needed to restrict the application of lexical rules (see also section 7 for a similar approach). For each stem, all results are stored in compact form as one compiled LISP file, which allows to access and load a requested entry rapidly with almost no restriction on the size of the lexicon. Although load time is small (see figure 1), the most frequently used entries are cached in main memory, reducing effort in the lexicon stage to a minimum.</Paragraph>
    <Paragraph position="3"> We continue to compute morphological information online, due to the significant increase of entries (a factor of 10 to 20 for German), which is not justifiable considering the minimal computation time for this operation.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML