File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/95/p95-1027_abstr.xml

Size: 1,472 bytes

Last Modified: 2025-10-06 13:48:29

<?xml version="1.0" standalone="yes"?>
<Paper uid="P95-1027">
  <Title>A Quantitative Evaluation of Linguistic Tests for the Automatic Prediction of Semantic Markedness</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We present a corpus-based study of methods that have been proposed in the linguistics literature for selecting the semantically unmarked term out of a pair of antonymous adjectives.</Paragraph>
    <Paragraph position="1"> Solutions to this problem are applicable to the more general task of selecting the positive term from the pair. Using automatically collected data, the accuracy and applicability of each method is quantified, and a statistical analysis of the significance of the results is performed.</Paragraph>
    <Paragraph position="2"> We show that some simple methods are indeed good indicators for the answer to the problem while other proposed methods fail to perform better than would be attributable to chance.</Paragraph>
    <Paragraph position="3"> In addition, one of the simplest methods, text frequency, dominates all others. We also apply two generic statistical learning methods for combining the indications of the individual methods, and compare their performance to the simple methods. The most sophisticated complex learning method offers a small, but statistically significant, improvement over the original tests.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML