File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/evalu/93/w93-0312_evalu.xml

Size: 1,494 bytes

Last Modified: 2025-10-06 14:00:17

<?xml version="1.0" standalone="yes"?>
<Paper uid="W93-0312">
  <Title>Example-Based Sense Tagging of Running Chinese Text Xiang Tong Chang-ning Huang</Title>
  <Section position="7" start_page="107" end_page="108" type="evalu">
    <SectionTitle>
5. Limitations and Future Work
</SectionTitle>
    <Paragraph position="0"> a. The system makes errors when the segmentation of the input texts is less than correct. The performance of the current sense tagger can be improved if more sophisticated segmentation method is adopted.</Paragraph>
    <Paragraph position="1"> b. Although the reasoning process takes advantage of collocational information within the phrase in which the untagged segment is a part, there is no guarantee that the phrase does not have multiple meanings. When such cases occur, the result of the reasoning is subject to chance.</Paragraph>
    <Paragraph position="2"> c. The example-based sense tagging method works quite well with content words, but for function words it often makes faulty guesses. This is partly due to the fact that function words are less sensitive to context. The current system assigns a default sense number for most function words. However, for those words which can both be a function word and a content word, the system often makes errors. This kind of errors decreases when the system preprocesses the input texts with a stochastic Chinese grammatical tagger like the one developed at Tsinghua University (Bai, et al. , 1992).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML