File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/w00-1323_abstr.xml
Size: 836 bytes
Last Modified: 2025-10-06 13:41:55
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1323"> <Title>Combining Lexical and Formatting Cues for Named Entity Acquisition from the Web</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Because of their constant renewal, it is necessary to acquire fresh named entities (NEs) from recent text sources. We present a tool for the acquisition and the typing of NEs from the Web that associates a harvester and three parallel shallow parsers dedicated to specific structures (lists, enumerations, and anchors).</Paragraph> <Paragraph position="1"> The parsers combine lexical indices such as discourse markers with formatting instructions (HTML tags) for analyzing enumerations and associated initializers.</Paragraph> </Section> class="xml-element"></Paper>