File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/96/c96-2193_metho.xml
Size: 4,085 bytes
Last Modified: 2025-10-06 14:14:22
<?xml version="1.0" standalone="yes"?> <Paper uid="C96-2193"> <Title>Senses of Polysemous Nouns: Building a Computational Lexicon of Basic Japanese Nouns</Title> <Section position="4" start_page="0" end_page="1082" type="metho"> <SectionTitle> 2 Structure of IPAL-BN </SectionTitle> <Paragraph position="0"> Each lexical entry is conlposed of orthographic information, idiomatic information, and subentries. Tile idiomatic in\]ormation accommodates the idiomatic or proverbial uses of the noun that have to be treated separately. Let us take an example, Hanako-wa hana-ga takai. This Japanese sentence has the idiomatic meaning, &quot;Hanako is proud,&quot; in addition to the ordinary nleaning, &quot;Hanako has a long nose.&quot; hi the idionlatic cases, the meaning of tile whole sentence cannot be decomposed into the meaning of each word. Thus we reserve the idiomatic information separately fi'om ordinary meaning sections.</Paragraph> <Paragraph position="1"> Then we introduce a hierarchy, subdividing each entry that has more than one usage of the word. Each usage is (:ailed a subentry. The subdivision to subentry is based not only on semantic but also on syntactic characteristics. This categorization process can be ilhtstrated with an exampie of hankyo 'echo': hant:y5 q~('he' 01. a somM t,hat is reflected off a surface such as tit(: wall of a lmilding.</Paragraph> <Paragraph position="2"> Bbru-no oto-ga hcya-no natca-de hankyS-suru.</Paragraph> <Paragraph position="3"> (The sound of the ball echoes in the room.) 02. one's (,xpr(:ssion of an opinion about or at.tit.mh: toward som('thing.</Paragraph> <Paragraph position="4"> Kon, o hanashi-wa hitobito-uo kokoro-ni hwnky5o yobiokosu darS.</Paragraph> <Paragraph position="5"> (This story will arouse an echo in every man's heart) One may note that hanky&O1 has a usage in which a noun t)ecomes a verb when followed by &quot;-suru&quot;, while hanky&02 does not. On the basis of tlfis (lifference, we divide this i20,112 into these two subentries.</Paragraph> <Paragraph position="6"> Figure 2 giw.'s an overview of each subc, ntry. A subentry consists of subentry information and several pieces of semantic property information. The subentry information contains syntaeti(:, semantic, and morphoh)gical information co,ninon to all parts of the subentry (each selnantie prop-erty ilfforlnation section). The semantic t)roperty information in(hides syntactic and semantic information. In the case of hanky5, we use the syntactic information in the subentry information to describe, tim difference in the usages (Kuwahata, In addition, we examine the sul)entries in more detail and introdu(:e the concept, of the ast)ects of nouns. \[,'or example, 'the letter' in 'I read the letter' focuses oi1 tim information in the letter, whereas its counterpart in 'I l>urned the letter' focuses on the thing (i.e., piece of paper) bearing that information. Since we can say 'I burned the letter that I had read', the word 'letter' does not have two meanings but rather has two aspects.</Paragraph> <Paragraph position="7"> Tiros a noun is consi<lere<l here to have various aspects depending on the predicates used in the sentence containing the noun. These aspects are called semantic properties (Aoyama, 1995). Instead of subdividing the lexical entry into multiple subentries, we categorized the regular collocations in each subentry in terms of semantic properties. For exan2ple, let, us take the word ha 'tooth /teeth' which has three semantic t)rot)erties. In our notation, the semantic properties are labeled by three letters in square brackets. The phrases ha-o migaku 'l)rush one's teeth' and ha-o nuku 'pull one's tooth' refer to tooth as a concrete object \[CON\]. And the phrases ha-ga haern 'cut teetlf and ha-ga nukeru 'h)se teeth' imply (natural) phenomena \[PHE\], while the t)hrases ha-ga jobu-da 'tlave sound teetlF and ha-ga guragurasuru 'a tooth feels h)ose' single out a condition of teeth from their potential conditions \[POT\].</Paragraph> </Section> class="xml-element"></Paper>