File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/99/e99-1010_intro.xml
Size: 865 bytes
Last Modified: 2025-10-06 14:06:53
<?xml version="1.0" standalone="yes"?> <Paper uid="E99-1010"> <Title>An Efficient Method for Determining Bilingual Word Classes</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In statistical natural language processing we always face the problem of sparse data. One way to reduce this problem is to group words into equivalence classes which is a standard method in statistical language modeling. In this paper we describe a method to determine bilingual word classes suitable for statistical machine translation. We develop an optimization criterion based on a maximum-likelihood approach and describe a clustering algorithm. We will show that the usage of the bilingual word classes we get can improve statistical machine translation. null</Paragraph> </Section> class="xml-element"></Paper>