File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-0134_abstr.xml

Size: 884 bytes

Last Modified: 2025-10-06 13:45:18

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-0134">
  <Title>Sydney, July 2006. c(c)2006 Association for Computational Linguistics A Pragmatic Chinese Word Segmentation System</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This paper presents our work for participation in the Third International Chinese Word Segmentation Bakeoff. We apply several processing approaches according to the corresponding sub-tasks, which are exhibited in real natural language. In our system, Trigram model with smoothing algorithm is the core module in word segmentation, and Maximum Entropy model is the basic model in Named Entity Recognition task. The experiment indicates that this system achieves F-measure 96.8% in MSRA open test in the third SIGHAN-2006 bakeoff.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML