File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/w03-1729_intro.xml
Size: 1,016 bytes
Last Modified: 2025-10-06 14:02:08
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-1729"> <Title>SYSTRAN's Chinese Word Segmentation</Title> <Section position="3" start_page="0" end_page="0" type="intro"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> SYSTRAN's Chinese word segmentation is one important component of its Chinese-English machine translation system. The Chinese word segmentation module uses a rule-based approach, based on a large dictionary and fine-grained linguistic rules. It works on general-purpose texts from different Chinese-speaking regions, with comparable performance. SYSTRAN participated in the four open tracks in the First</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> International Chinese Word Segmentation </SectionTitle> <Paragraph position="0"> Bakeoff. This paper gives a general description of the segmentation module, as well as the results and analysis of its performance in the Bakeoff.</Paragraph> </Section> </Section> class="xml-element"></Paper>