File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/p00-1015_concl.xml
Size: 1,721 bytes
Last Modified: 2025-10-06 13:52:50
<?xml version="1.0" standalone="yes"?> <Paper uid="P00-1015"> <Title>A Unified Statistical Model for the Identification of English BaseNP</Title> <Section position="7" start_page="121" end_page="121" type="concl"> <SectionTitle> 5 Conclusions </SectionTitle> <Paragraph position="0"> This paper presented a unified statistical model to identify baseNP in English text. Compared with other methods, our approach has following characteristics: (1) baseNP identification is implemented in two related stages: N-best POS taggings are first determined, then baseNPs are identified given the N best POS-sequences. Unlike other approaches that use POS tagging as preprocessing, our approach is not dependant on perfect POS-tagging, Moreover, we can apply baseNP information to further increase the precision of POS tagging can be improved.</Paragraph> <Paragraph position="1"> These experiments triggered an interesting future research challenge: how to cluster certain baseNP rules into certain identifiers so as to improve the precision of both baseNP and POS tagging. This is one of our further research topics.</Paragraph> <Paragraph position="2"> (2) Our statistical model makes use of more lexical information than other approaches. Every word in the sentence is taken into account during baseNP identification.</Paragraph> <Paragraph position="3"> (3) Viterbi algorithm is applied to make global search at the sentence level.</Paragraph> <Paragraph position="4"> Experiment with the same testing data used by the other methods showed that the precision is 92.3% and the recall is 93.2%. To our knowledge, these results are comparable with or better than all previously reported results.</Paragraph> </Section> class="xml-element"></Paper>