File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/w03-1716_intro.xml

Size: 2,560 bytes

Last Modified: 2025-10-06 14:02:07

<?xml version="1.0" standalone="yes"?>
<Paper uid="W03-1716">
  <Title>The Semantic Knowledge-base of Contemporary Chinese and its Applications in WSD [?]</Title>
  <Section position="4" start_page="0" end_page="0" type="intro">
    <SectionTitle>
2 Outline of SKCC
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
2.1 Scale and Structure
</SectionTitle>
      <Paragraph position="0"> SKCC consists of one general database and six sub-databases. The general database contains shared attributes of all the 66,539 entries, while the sub-databases provide detailed descriptions of the distinctive semantic attributes associated with the parts of speech (POS). For example, the verb data-base has 16 attribute fields, noun database and adjective database has 15 attribute fields respectively.  All of the six sub-databases can be linked to the general database through four key fields, namely ENTRY, POS, HOMOMORPHISM and SENSE. As a result, the son knots can inherit all information from their father knots (Figure 1).</Paragraph>
      <Paragraph position="1"> Figure 1 Main structure of SKCC</Paragraph>
    </Section>
    <Section position="2" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
2.2 Semantic Hierarchy
</SectionTitle>
      <Paragraph position="0"> One of the most outstanding characteristics of SKCC is that its semantic hierarchy is based on grammatical analysis, rather than merely on general knowledge (as illustrated in Figure 2 below).</Paragraph>
      <Paragraph position="1"> This classification system represents the latest progress in Chinese semantics. It is very useful for NLP applications(Zhan Weidong, 1997), as well as compatible with various semantic resources, such as Wordnet (Christiane Fellbaum. 1998), Chinese concept dictionary (CCD)( Yu Jiangsheng, 2002), HowNet(Dong Zhendong, 2000) etc. Currently, the classification of all of the 66,539 entries has already been completed.</Paragraph>
    </Section>
    <Section position="3" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
2.3 Comprehensive Semantic Descriptions
</SectionTitle>
      <Paragraph position="0"> There is close correlation between lexical meaning and its distribution. Oriented to MT and IR, one aim of SKCC is to provide detailed semantic description and collocation behavior that in many cases is likely to be uniquely associated with a single sense. For example, following attribute fields have been filled with values in the verb database</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML