File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/w04-0406_abstr.xml

Size: 997 bytes

Last Modified: 2025-10-06 13:43:43

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0406">
  <Title>Multiword Expression Filtering for Building Knowledge Maps</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This paper describes an algorithm that can be used to improve the quality of multiword expressions extracted from documents. We measure multiword expression quality by the &amp;quot;usefulness&amp;quot; of a multiword expression in helping ontologists build knowledge maps that allow users to search a large document corpus.</Paragraph>
    <Paragraph position="1"> Our stopword based algorithm takes n-grams extracted from documents, and cleans them up to make them more suitable for building knowledge maps. Running our algorithm on large corpora of documents has shown that it helps to increase the percentage of useful terms from 40% to 70% - with an eight-fold improvement observed in some cases.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML