File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/99/p99-1067_abstr.xml

Size: 1,140 bytes

Last Modified: 2025-10-06 13:49:52

<?xml version="1.0" standalone="yes"?>
<Paper uid="P99-1067">
  <Title>Automatic Identification of Word Translations from Unrelated English and German Corpora</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Algorithms for the alignment of words in translated texts are well established. However, only recently new approaches have been proposed to identify word translations from non-parallel or even unrelated texts.</Paragraph>
    <Paragraph position="1"> This task is more difficult, because most statistical clues useful in the processing of parallel texts cannot be applied to non-parallel texts. Whereas for parallel texts in some studies up to 99% of the word alignments have been shown to be correct, the accuracy for non-parallel texts has been around 30% up to now. The current study, which is based on the assumption that there is a correlation between the patterns of word co-occurrences in corpora of different languages, makes a significant improvement to about 72% of word translations identified correctly.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML