File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/04/w04-0409_concl.xml

Size: 962 bytes

Last Modified: 2025-10-06 13:54:11

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0409">
  <Title>Integrating Morphology with Multi-word Expression Processing in Turkish</Title>
  <Section position="6" start_page="0" end_page="0" type="concl">
    <SectionTitle>
5 Conclusions
</SectionTitle>
    <Paragraph position="0"> This paper has described a multi-word expression extraction system for Turkish for handling various types of multi-word expressions such as semi-lexicalized and non-lexicalized collocations which depend on the recognition of certain morphological patterns across tokens. Our results indicate that with about 1100 rules (most of which were extracted from a large training corpus searching for patterns involving a certain small set of support verbs), we were able get almost 100% precision and around 60% recall on a small test corpus. We expect that with additional rules from dictionaries and other sources we will improve recall signi cantly.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML