File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/w03-0428_abstr.xml
Size: 861 bytes
Last Modified: 2025-10-06 13:43:01
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-0428"> <Title>Named Entity Recognition with Character-Level Models</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We discuss two named-entity recognition models which use characters and character a4 -grams either exclusively or as an important part of their data representation. The first model is a character-level HMM with minimal context information, and the second model is a maximum-entropy conditional markov model with substantially richer context features. Our best model achieves an overall Fa5 of 86.07% on the English test data (92.31% on the development data). This number represents a 25% error reduction over the same model without word-internal (substring) features.</Paragraph> </Section> class="xml-element"></Paper>