File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/05/h05-1113_intro.xml
Size: 1,145 bytes
Last Modified: 2025-10-06 14:02:58
<?xml version="1.0" standalone="yes"?> <Paper uid="H05-1113"> <Title>Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pages 899-906, Vancouver, October 2005. c(c)2005 Association for Computational Linguistics Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features</Title> <Section position="4" start_page="899" end_page="899" type="intro"> <SectionTitle> 2 Basic Architecture </SectionTitle> <Paragraph position="0"> Every V-N collocation is represented as a vector of features which are composed largely of various statistical measures. The values of these features for the V-N collocations are extracted from the British National Corpus. For example, the V-N collocation 'raise an eyebrow' can be represented as a0 Frequency = 271, Mutual Information = 8.43, Distributed frequency of object = 1456.29, etc.a1 . A SVM based ranking function uses these features to rank the V-N collocations based on their relative compositionality. These ranks are then compared with the human ranking.</Paragraph> </Section> class="xml-element"></Paper>