File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/06/w06-1627_concl.xml

Size: 2,444 bytes

Last Modified: 2025-10-06 13:55:36

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-1627">
  <Title>Efficient Search for Inversion Transduction Grammar</Title>
  <Section position="8" start_page="230" end_page="230" type="concl">
    <SectionTitle>
6 Conclusion
</SectionTitle>
    <Paragraph position="0"> A* search for Viterbi alignment selection under ITG is ef cient using IBM Model 1 as an outside estimate. The experimental results indicate that despite being a more relaxed word-for-word alignment model than ITG, IBM Model 1 can serve as an ef cient and reliable approximation of ITG in terms of Viterbi alignment probability. This is more true when we apply Model 1 to both translation directions and take the minimum of both.</Paragraph>
    <Paragraph position="1"> We have also tried to incorporate estimates of binary rule probabilities to make the outside estimate even sharper. However, the further improvement was marginal.</Paragraph>
    <Paragraph position="2"> We are able to nd the ITG Viterbi translation using our A* decoding algorithm with an outside estimate that combines outside bigrams and translation probabilities for outside words. The hook trick gave us a signi cant further speedup; we believe this to be the rst demonstrated practical application of this technique.</Paragraph>
    <Paragraph position="3"> Interestingly, the BLEU score for the optimal translations under the probabilistic model is lower than we achieve with our best bigram-based system using pruning. However, this system makes use of the A* heuristic, and our speed/performance curve shows that the heuristic allows us to achieve higher BLEU scores with the same amount of computation. In the case of trigram integrated decoding, there is 1 point of BLEU score improvement by moving from a typical CYK plus beam search decoder to a decoder using A* plus beam search.</Paragraph>
    <Paragraph position="4"> However, without knowing what words will appear in the output language, a very sharp outside estimate to further bring down the number of combinations is dif cult to achieve.</Paragraph>
    <Paragraph position="5"> The brighter side of the move towards optimal decoding is that the A* search strategy leads us to the region of the search space that is close to the optimal result, where we can more easily nd good translations.</Paragraph>
    <Paragraph position="6"> Acknowledgments This work was supported by NSF ITR IIS-09325646 and NSF ITR IIS0428020. null</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML