File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/06/w06-0134_concl.xml

Size: 982 bytes

Last Modified: 2025-10-06 13:55:31

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-0134">
  <Title>Sydney, July 2006. c(c)2006 Association for Computational Linguistics A Pragmatic Chinese Word Segmentation System</Title>
  <Section position="6" start_page="191" end_page="191" type="concl">
    <SectionTitle>
5 Conclusion
</SectionTitle>
    <Paragraph position="0"> We have briefly described our word segmentation system and NER system. We use word-based features in the whole processing. Our system has a good performance in terms of R iv measure, so this means that the trigram model with the smoothing algorithm can deal with the basic segmentation task well. However, the result in the bakeoff indicates that detecting out-of-vocabulary word seems to be a harder task than dealing with the segmentation-ambiguity task.</Paragraph>
    <Paragraph position="1"> The work in the future will concentrate on two sides: improving the NER performance and adding New Word Detection Algorithm.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML