File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/n06-2016_abstr.xml
Size: 1,143 bytes
Last Modified: 2025-10-06 13:44:54
<?xml version="1.0" standalone="yes"?> <Paper uid="N06-2016"> <Title>Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Cross-language retrieval of spontaneous speech combines the challenges of working with noisy automated transcription and language translation. The CLEF 2005 Cross-</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> Language Speech Retrieval (CL-SR) task </SectionTitle> <Paragraph position="0"> provides a standard test collection to investigate these challenges. We show that we can improve retrieval performance: by careful selection of the term weighting scheme; by decomposing automated transcripts into phonetic substrings to help ameliorate transcription errors; and by combining automatic transcriptions with manually-assigned metadata. We further show that topic translation with online machine translation resources yields effective CL-SR.</Paragraph> </Section> </Section> class="xml-element"></Paper>