File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/w00-1314_abstr.xml
Size: 879 bytes
Last Modified: 2025-10-06 13:41:55
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1314"> <Title>Word Alignment of English-Chinese Bilingual Corpus Based on Chunks</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In this paper, a method for the word alignment of English-Chinese corpus based on chunks is proposed. The chunks of English sentences are identified firstly. Then the chunk boundaries of Chinese sentences are predicted by the translations of English chunks and heuristic information. The ambiguities of Chinese chunk boundaries are resolved by the coterminous words in English chunks. With the chunk aligned bilingual corpus, a translation relation probability is proposed to align words. Finally, we evaluate our system by real corpus and present the experiment results.</Paragraph> </Section> class="xml-element"></Paper>