File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/98/p98-2198_abstr.xml

Size: 1,778 bytes

Last Modified: 2025-10-06 13:49:27

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-2198">
  <Title>Locating noun phrases with finite state transducers.</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We present a method for constructing, maintaining and consulting a database of proper nouns. We describe noun phrases composed of a proper noun and/or a description of a human occupation. They are formalized by finite state transducers (FST) and large coverage dictionaries and are applied to a corpus of newspapers. We take into account synonymy and hyperonymy. This first stage of our parsing procedure has a high degree of accuracy. We show how we can handle requests such as: 'Find all newspaper articles in a general corpus mentioning the French prime minister', or 'How is Mr. X referred to in the corpus; what have been his different occupations through out the period over which our corpus extends?' In the first case, non trivial occurrences of noun phrases are located, that is phrases not containing words present in the request~ but either synonyms, or proper nouns relevant to request. The results of the search is far better than than those obtained by a key-word based engine. Most answers are correct: except some cases of homonymy (where a human reader would also fail without more context). Also, the treatment of people having several different occupations is not fully resolved.</Paragraph>
    <Paragraph position="1"> We have built for French, a library of about one thousand such FSTs., and English FSTs arc under construction. The same method can be used to locate and propose new proper nouns, simply by replacing given proper names in the same FSTs by variables.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML