File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/98/p98-1069_concl.xml

Size: 1,019 bytes

Last Modified: 2025-10-06 13:58:06

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-1069">
  <Title>An IR Approach for Translating New Words from Nonparallel, Comparable Texts</Title>
  <Section position="8" start_page="418" end_page="418" type="concl">
    <SectionTitle>
11 Conclusions
</SectionTitle>
    <Paragraph position="0"> We have devised an algorithm using context seed word TF/IDF for extracting bilingual lexicon from nonparallel, comparable corpus in English-Chinese. This algorithm takes into account the reliability of bilingual seed words and is language independent. This algorithm can be applied to other language pairs such as English-French or English-German. In these cases, since the languages are more similar linguistically and the seed word lexicon is more reliable, the algorithm should yield better results. This algorithm can also be applied in an iterative fashion where high-ranking bilingual word pairs can be added to the seed word list, which in turn can yield more new bilingual word pairs.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML