File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/06/w06-1204_intro.xml
Size: 1,191 bytes
Last Modified: 2025-10-06 14:03:56
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-1204"> <Title>Using Information about Multi-word Expressions for the Word-Alignment Task</Title> <Section position="4" start_page="20" end_page="20" type="intro"> <SectionTitle> 3 Behavior of MWEs in parallel corpora </SectionTitle> <Paragraph position="0"> In this section, we will briefly discuss the complexity of the alignment problem based on the verb based MWE's. From the word aligned sentence pairs, we compute the fraction of times a source sentence verb and its dependent are aligned together with the same word in the target language sentence. We count the number of times a source sentence verb and its dependent are aligned together with the same word in the target language sentence, and divide it by the total number of dependents. The total size of our word aligned corpus is 400 sentence pairs which includes both training and test sentences. The total number of dependents present in these sentences are 2209. Total number of verb dependent pairs which aligned with same word in target language are 193. Hence, the percentage of such occurrences is 9%, which is a significant number.</Paragraph> </Section> class="xml-element"></Paper>