File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/04/w04-0409_concl.xml
Size: 962 bytes
Last Modified: 2025-10-06 13:54:11
<?xml version="1.0" standalone="yes"?> <Paper uid="W04-0409"> <Title>Integrating Morphology with Multi-word Expression Processing in Turkish</Title> <Section position="6" start_page="0" end_page="0" type="concl"> <SectionTitle> 5 Conclusions </SectionTitle> <Paragraph position="0"> This paper has described a multi-word expression extraction system for Turkish for handling various types of multi-word expressions such as semi-lexicalized and non-lexicalized collocations which depend on the recognition of certain morphological patterns across tokens. Our results indicate that with about 1100 rules (most of which were extracted from a large training corpus searching for patterns involving a certain small set of support verbs), we were able get almost 100% precision and around 60% recall on a small test corpus. We expect that with additional rules from dictionaries and other sources we will improve recall signi cantly.</Paragraph> </Section> class="xml-element"></Paper>