File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/evalu/06/w06-3106_evalu.xml
Size: 1,333 bytes
Last Modified: 2025-10-06 13:59:58
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-3106"> <Title>Phrase-Based SMT with Shallow Tree-Phrases</Title> <Section position="7" start_page="44" end_page="44" type="evalu"> <SectionTitle> 4.4 Results </SectionTitle> <Paragraph position="0"> The scores for the 16 slices of the test corpus are reported in Table 2. TP-ENGINE shows slightly better figures for all metrics.</Paragraph> <Paragraph position="1"> For each system and for each metric, we had 16 scores (from each of the 16 slices of the test corpus)andwerethereforeabletotestthestatisticalsig- null nicance of the difference between the TP-ENGINE and PP-ENGINE using a Wilcoxon signed-rank test for paired samples. This test showed that the difference observed between the two systems is significant at the 95% probability level for BLEU and significant at the 99% level for WER and SER.</Paragraph> <Paragraph position="2"> (+-value range) of the translations produced by the two engines on a test set of 16 disjoint corpora of 500 sentences each. The figures reported are percentages. null On the DEV corpus, we measured that, on average,eachsourcesentenceiscoveredby39TPs(their null source part, naturally), yielding a source coverage of approximately70%. Incontrast, theaveragenumber of covering PPs per sentence is 233.</Paragraph> </Section> class="xml-element"></Paper>