File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/n06-2016_abstr.xml

Size: 1,143 bytes

Last Modified: 2025-10-06 13:44:54

<?xml version="1.0" standalone="yes"?>
<Paper uid="N06-2016">
  <Title>Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Cross-language retrieval of spontaneous speech combines the challenges of working with noisy automated transcription and language translation. The CLEF 2005 Cross-</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
Language Speech Retrieval (CL-SR) task
</SectionTitle>
      <Paragraph position="0"> provides a standard test collection to investigate these challenges. We show that we can improve retrieval performance: by careful selection of the term weighting scheme; by decomposing automated transcripts into phonetic substrings to help ameliorate transcription errors; and by combining automatic transcriptions with manually-assigned metadata. We further show that topic translation with online machine translation resources yields effective CL-SR.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML