File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/93/w93-0301_abstr.xml

Size: 1,269 bytes

Last Modified: 2025-10-06 13:47:54

<?xml version="1.0" standalone="yes"?>
<Paper uid="W93-0301">
  <Title>Robust Bilingual Word Alignment for Machine Aided Translation</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We have developed a new program called word_align for aligning parallel text, text such as the Canadian Hansards that are available in two or more languages. The program takes the output of char_align (Church, 1993), a robust alternative to sentence-based alignment programs, and applies word-level constraints using a version of Brown el al.'s Model 2 (Brown et al., 1993), modified and extended to deal with robustness issues. Word_align was tested on a subset of Canadian Hansards supplied by Simard (Simard et al., 1992). The combination of word_align plus char_align reduces the variance (average square error) by a factor of 5 over char_align alone. More importantly, because word_align and char_align were designed to work robustly on texts that are smaller and more noisy than the Hansards, it has been possible to successfully deploy the programs at AT&amp;T Language Line Services, a commercial translation service, to help them with difficult terminology.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML