File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/w00-1323_concl.xml
Size: 871 bytes
Last Modified: 2025-10-06 13:52:58
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1323"> <Title>Combining Lexical and Formatting Cues for Named Entity Acquisition from the Web</Title> <Section position="9" start_page="188" end_page="188" type="concl"> <SectionTitle> 6 Conclusion </SectionTitle> <Paragraph position="0"> This study is another application that demonstrates the usability of the WWW as a resource for NLP (see, for instance, (Grefenstette, 1999) for an application of using WWW frequencies in selecting translations).</Paragraph> <Paragraph position="1"> It also confirms the interest of non-textual linguistic features, such as formatting markups, inNLP for structured documents such as Web pages. Further work on Web-based NE acquisition could take advantage of machine learning techniques as used for wrapper induction (Kushmerick et al., 1997).</Paragraph> </Section> class="xml-element"></Paper>