File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/e06-1018_abstr.xml

Size: 1,274 bytes

Last Modified: 2025-10-06 13:44:42

<?xml version="1.0" standalone="yes"?>
<Paper uid="E06-1018">
  <Title>Word Sense Induction: Triplet-Based Clustering and Automatic Evaluation</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper a novel solution to automatic and unsupervised word sense induction (WSI) is introduced. It represents an instantiation of the 'one sense per collocation' observation (Gale et al., 1992). Like most existing approaches it utilizes clustering of word co-occurrences. This approach differs from other approaches to WSI in that it enhances the effect of the one sense per collocation observation by using triplets of words instead of pairs.</Paragraph>
    <Paragraph position="1"> The combination with a two-step clustering process using sentence co-occurrences as features allows for accurate results. Additionally, a novel and likewise automatic and unsupervised evaluation method inspired by Sch&amp;quot;utze's (1992) idea of evaluation of word sense disambiguation algorithms is employed. Offering advantages like reproducability and independency of a given biased gold standard it also enables automatic parameter optimization of the WSI algorithm.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML