File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/99/w99-0633_intro.xml

Size: 1,206 bytes

Last Modified: 2025-10-06 14:07:05

<?xml version="1.0" standalone="yes"?>
<Paper uid="W99-0633">
  <Title>IMPROVING BRILL'S POS TAGGER FOR AN AGGLUTINATIVE LANGUAGE</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
In this paper Brill's rule-based PoS
</SectionTitle>
      <Paragraph position="0"> tagger is tested and adapted for Hungarian. It is shown that the present system does not obtain as high accuracy for Hungarian as it does for English (and other Germanic languages) because of the structural difference between these languages. Hungarian, unlike English, has rich morphology, is agglutinative with some inflectional characteristics and has fairly free word order. The tagger has the greatest difficulties with parts-of-speech belonging to open classes because of their complicated morphological structure. It is shown that the accuracy of tagging can be increased from approximately 83% to 97% by simply changing the rule generating mechanisms, namely the lexical templates in the lexical training module.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML