File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/95/p95-1033_concl.xml

Size: 1,616 bytes

Last Modified: 2025-10-06 13:57:28

<?xml version="1.0" standalone="yes"?>
<Paper uid="P95-1033">
  <Title>An Algorithm for Simultaneously Bracketing Parallel Texts by Aligning Words</Title>
  <Section position="9" start_page="250" end_page="250" type="concl">
    <SectionTitle>
7 Conclusion
</SectionTitle>
    <Paragraph position="0"> We have proposed a new tool for the corpus linguist's arsenal: a method for simultaneously bracketing both halves of a parallel bilingual corpus, using only a word translation lexicon. The method can also be seen as a word alignment algorithm that employs a realistic distortion model and aligns consituents as well as words.</Paragraph>
    <Paragraph position="1"> The basis of the approach is a new inversion-invariant transduction grammar formalism.</Paragraph>
    <Paragraph position="2"> Various extension strategies for simultaneous segmentation, positional biases, punctuation constraints, singleton rebalancing, and bracket flattening have been introduced. Parallel bracketing exploits a relatively untapped source of constraints, in that parallel bilingual sentences are used to mutually analyze each other. The model nonetheless retains a high degree of compatibility with more conventional monolingual formalisms and methods.</Paragraph>
    <Paragraph position="3"> The bracketing and alignment of parallel corpora can be fully automatized with zero initial knowledge resources, with the aid of automatic procedures for learning word translation lexicons. This is particularly valuable for work on languages for which online knowledge resources are relatively scarce compared with English.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML