File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1074_abstr.xml
Size: 1,141 bytes
Last Modified: 2025-10-06 13:44:25
<?xml version="1.0" standalone="yes"?> <Paper uid="P05-1074"> <Title>Paraphrasing with Bilingual Parallel Corpora</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource.</Paragraph> <Paragraph position="1"> Using alignment techniques from phrase-based statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account.</Paragraph> <Paragraph position="2"> We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments.</Paragraph> </Section> class="xml-element"></Paper>