File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/p04-1066_abstr.xml
Size: 699 bytes
Last Modified: 2025-10-06 13:43:40
<?xml version="1.0" standalone="yes"?> <Paper uid="P04-1066"> <Title>Improving IBM Word-Alignment Model 1</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We investigate a number of simple methods for improving the word-alignment accuracy of IBM Model 1. We demonstrate reduction in alignment error rate of approximately 30% resulting from (1) giving extra weight to the probability of alignment to the null word, (2) smoothing probability estimates for rare words, and (3) using a simple heuristic estimation method to initialize, or replace, EM training of model parameters.</Paragraph> </Section> class="xml-element"></Paper>