File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/w04-0847_intro.xml

Size: 799 bytes

Last Modified: 2025-10-06 14:02:36

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0847">
  <Title>Optimizing Feature Set for Chinese Word Sense Disambiguation</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This article describes the implementation of I2R word sense disambiguation system (I2R !WSD) that participated in one senseval3 task: Chinese lexical sample task. Our core algorithm is a supervised Naive Bayes classifier. This classifier utilizes an optimal feature set, which is determined by maximizing the cross validated accuracy of NB classifier on training data. The optimal feature set includes part-of-speech with position information in local context, and bag of words in topical context.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML