File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/99/p99-1067_abstr.xml
Size: 1,140 bytes
Last Modified: 2025-10-06 13:49:52
<?xml version="1.0" standalone="yes"?> <Paper uid="P99-1067"> <Title>Automatic Identification of Word Translations from Unrelated English and German Corpora</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Algorithms for the alignment of words in translated texts are well established. However, only recently new approaches have been proposed to identify word translations from non-parallel or even unrelated texts.</Paragraph> <Paragraph position="1"> This task is more difficult, because most statistical clues useful in the processing of parallel texts cannot be applied to non-parallel texts. Whereas for parallel texts in some studies up to 99% of the word alignments have been shown to be correct, the accuracy for non-parallel texts has been around 30% up to now. The current study, which is based on the assumption that there is a correlation between the patterns of word co-occurrences in corpora of different languages, makes a significant improvement to about 72% of word translations identified correctly.</Paragraph> </Section> class="xml-element"></Paper>