File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/w03-1508_abstr.xml

Size: 1,050 bytes

Last Modified: 2025-10-06 13:43:13

<?xml version="1.0" standalone="yes"?>
<Paper uid="W03-1508">
  <Title>Transliteration of Proper Names in Cross-Lingual Information Retrieval</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We address the problem of transliterating English names using Chinese orthography in support of cross-lingual speech and text processing applications. We demonstrate the application of statistical machine translation techniques to &amp;quot;translate&amp;quot; the phonemic representation of an English name, obtained by using an automatic text-to-speech system, to a sequence of initials and finals, commonly used sub-word units of pronunciation for Chinese.</Paragraph>
    <Paragraph position="1"> We then use another statistical translation model to map the initial/final sequence to Chinese characters. We also present an evaluation of this module in retrieval of Mandarin spoken documents from the TDT corpus using English text queries.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML