File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/w03-1729_intro.xml

Size: 1,016 bytes

Last Modified: 2025-10-06 14:02:08

<?xml version="1.0" standalone="yes"?>
<Paper uid="W03-1729">
  <Title>SYSTRAN's Chinese Word Segmentation</Title>
  <Section position="3" start_page="0" end_page="0" type="intro">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> SYSTRAN's Chinese word segmentation is one important component of its Chinese-English machine translation system. The Chinese word segmentation module uses a rule-based approach, based on a large dictionary and fine-grained linguistic rules. It works on general-purpose texts from different Chinese-speaking regions, with comparable performance. SYSTRAN participated in the four open tracks in the First</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
International Chinese Word Segmentation
</SectionTitle>
      <Paragraph position="0"> Bakeoff. This paper gives a general description of the segmentation module, as well as the results and analysis of its performance in the Bakeoff.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML