File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/99/e99-1010_intro.xml

Size: 865 bytes

Last Modified: 2025-10-06 14:06:53

<?xml version="1.0" standalone="yes"?>
<Paper uid="E99-1010">
  <Title>An Efficient Method for Determining Bilingual Word Classes</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In statistical natural language processing we always face the problem of sparse data. One way to reduce this problem is to group words into equivalence classes which is a standard method in statistical language modeling. In this paper we describe a method to determine bilingual word classes suitable for statistical machine translation. We develop an optimization criterion based on a maximum-likelihood approach and describe a clustering algorithm. We will show that the usage of the bilingual word classes we get can improve statistical machine translation. null</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML