File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/w00-1323_abstr.xml

Size: 836 bytes

Last Modified: 2025-10-06 13:41:55

<?xml version="1.0" standalone="yes"?>
<Paper uid="W00-1323">
  <Title>Combining Lexical and Formatting Cues for Named Entity Acquisition from the Web</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Because of their constant renewal, it is necessary to acquire fresh named entities (NEs) from recent text sources. We present a tool for the acquisition and the typing of NEs from the Web that associates a harvester and three parallel shallow parsers dedicated to specific structures (lists, enumerations, and anchors).</Paragraph>
    <Paragraph position="1"> The parsers combine lexical indices such as discourse markers with formatting instructions (HTML tags) for analyzing enumerations and associated initializers.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML