File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/02/w02-1211_concl.xml

Size: 1,527 bytes

Last Modified: 2025-10-06 13:53:25

<?xml version="1.0" standalone="yes"?>
<Paper uid="W02-1211">
  <Title>Constructing of a Large-Scale Chinese-English Parallel Corpus</Title>
  <Section position="5" start_page="0" end_page="0" type="concl">
    <SectionTitle>
6 Conclusion &amp; Further Prospects
</SectionTitle>
    <Paragraph position="0"> In this paper, we introduce the developing project, that is, the constructing of a large-scale (above 500,000 pair sentences) Chinese-English parallel corpus. The current status of Chinese corpora is overviewed with the emphasis on parallel corpus.</Paragraph>
    <Paragraph position="1"> The XML coding principles for Chinese-English parallel corpus are discussed. The sentence alignment algorithm used in this project is described with a computer-aided checking processing in order to processes the noise Chinese-English parallel resource.. We show the design of the bilingual concordance for the parallel corpus, also.</Paragraph>
    <Paragraph position="2"> As a beginning project, there is still much room for further development. The parallel resource is relative rare, so the new ways, such as, data exchange with other researcher institute and translation company, should be launched to obtain more parallel resource which can be used to research society. The coding principle should be adjusted in real work. A coding rule in more detail should form in near future. We also intend to add the option for recommendation the correspondence translation word for input keywords in concordance tool.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML