File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1074_abstr.xml

Size: 1,141 bytes

Last Modified: 2025-10-06 13:44:25

<?xml version="1.0" standalone="yes"?>
<Paper uid="P05-1074">
  <Title>Paraphrasing with Bilingual Parallel Corpora</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource.</Paragraph>
    <Paragraph position="1"> Using alignment techniques from phrase-based statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account.</Paragraph>
    <Paragraph position="2"> We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML