File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/05/i05-6011_metho.xml

Size: 25,556 bytes

Last Modified: 2025-10-06 14:09:40

<?xml version="1.0" standalone="yes"?>
<Paper uid="I05-6011">
  <Title>Annotating Honorifics Denoting Social Ranking of Referents</Title>
  <Section position="2" start_page="0" end_page="91" type="metho">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> To varying extents, languages have ways to reflect the speaker's deference towards the addressee and people being referred to in utterances (c.f. Brown and Levinson 1987): by adopting a more polite air or tone of voice, avoiding coarse language, and modifying one's choice of specific vocabulary. This is prominent in Asian languages, Japanese and Korean in particular, which exhibit an extensive use of honorifics (respectful words).</Paragraph>
    <Paragraph position="1"> Morphologically for example, French has a choice of the familiar tu and the formal vous (a third person plural) for the second person referent. Similarly Greek has the same choice: esei and eseis respectively. European languages commonly project one's deference by the use of different personal pronouns and titles (e.g. Mr., Dr., and Hon.).</Paragraph>
    <Paragraph position="2"> Japanese and Korean, on the other hand, have numerous ways to say 'I' or 'you' calibrated by social position, age, gender and other factors. The projection of honorifics extends over the vocabulary of verbs, adjectives, and nouns as well as sentence structures, to elevate a person or humble oneself. (1) and (2) below from Japanese are such examples, which use honorific verbs instead of neutral forms kuru 'come', iku 'go/accompany', and motomeru 'seek':  notorious problem of zero pronoun resolution in Japanese, where the subject and the object of a sentence are frequently left unexpressed (Nakaiwa 2002, Nariyama 2003, inter alia). It is clear from the examples that coding the honorific relations of referents provides vital information for identifying what zero pronouns refer to; namely, to know whether or not a predicate denotes disparity of social rank between referents and to identify the rank of the referents. This is what this paper proposes to do. Siegel (2000) reported that 23.9% of Japanese zero pronouns in task-oriented dialogues can be resolved using information gleaned from honorification.</Paragraph>
    <Paragraph position="3"> Coding of honorifics also improves machine translation outputs into Japanese in choosing the correct predicate depending on the relationship of the referents. Inappropriate use of honorifics, in particular the use of the plain form where an honorific form should be used, is rude and can be offensive.</Paragraph>
    <Paragraph position="4">  Section 2 reviews some earlier work on this topic in NLP; Section 3 elaborates on honorifics; Section 4 formulates the ranking factors; Section 5 proposes a way to assign ranks to referents; Section 6 discusses a way to calibrate rankings of referents, as ranks are relative to the ranks of other referents in the sentence; Section 7 describes our annotation scheme, and finally conclusion in Section 8.</Paragraph>
  </Section>
  <Section position="3" start_page="91" end_page="91" type="metho">
    <SectionTitle>
2 Earlier studies
</SectionTitle>
    <Paragraph position="0"> The Japanese honorification system has been studied extensively in linguistics, particularly in sociolinguistics. Because of its importance and frequent use in the Japanese language, there has been some related work in NLP; within the framework of grammar formalism, GPSG by Ikeya (1983), JPSG by Gunji (1987), and more recently HPSG by Siegel (2000); work from a view point of resolving zero pronouns in dialogues by Dohsaka (1990).</Paragraph>
    <Paragraph position="1"> Of these, the most thorough work on Japanese honorification is seen in JACY, a Japanese HPSG grammar (Siegel 2000, Siegel and Bender 2002). It extends the BACKGR (owe - honour) relation (Pollard and Sag 1994), which accounts only for subject honorifics, to accommodate the other types of honorification used in Japanese (see Section 3.1 for the types). The full account of the Japanese honorification system requires syntactic and pragmatic information in many dimensions, with more input from the latter, the gathering of which is an extremely convoluted task. This paper builds on the basics from JACY and complements it in two ways to extend the JACY annotation presented in Section 7.</Paragraph>
    <Paragraph position="2">  1. Ranking referents in social hierarchy 2. Calibrating the ranks  Regarding 1, honorifics tell which referent is higher in rank, so each referent must be assigned a rank to make use of honorific information. This is crucial when generating sentences to assign appropriate forms of honorific nouns and predicates in machine translation output into Japanese. In processing, ranking referents is not usually of importance when referents are overt, but it is when referents are zero pronouns. The identification of zero pronouns relies heavily on the honorific information conveyed in the predicates.</Paragraph>
    <Paragraph position="3"> Regarding 2, social rank is not absolute, but relative, so that the same referent may be higher or lower depending on which referent it appears with in a sentence. For example, the president of a company is socially regarded as ranked higher than the managers, who are in turn higher than clerks, but this rank is outweighed when their clients come in the sentence, in which case the president is ranked lower than their clients.</Paragraph>
  </Section>
  <Section position="4" start_page="91" end_page="94" type="metho">
    <SectionTitle>
3 Honorifics
</SectionTitle>
    <Paragraph position="0"> Honorifics is a term used to represent words that convey esteem or respect. Extensive studies on Japanese honorification revealed many forms of honorifics in use. The use of honorification is mandatory in many social situation. 1 Hence, every sentence can be viewed as coded for honorification if we consider the lack of an honorific marking as a sign that there is no hierarchical difference between referents.</Paragraph>
    <Paragraph position="1"> Types of honorifics that indicate who is shown respect are described in Subsection 3.1, and forms of honorifics in Subsection 3.2.</Paragraph>
    <Section position="1" start_page="91" end_page="92" type="sub_section">
      <SectionTitle>
3.1 Types of honorifics
</SectionTitle>
      <Paragraph position="0"> Honorifics in modern (post-war) Japanese are generally classified into the following three categories, depending on who is shown respect (Martin 1964, Matsumoto 1997, Nariyama 2003, inter alia). The first two types are often referred to as 'propositional (referential) honorifics'.</Paragraph>
      <Paragraph position="1"> i. Subject honorifics (called Sonkeigo in Japanese): to elevate or show respect towards the subject of a sentence ii. Non-subject honorifics (called Humility, or Kenjogo): to humble oneself by showing respect to the non-subject referent, generally the object iii. Addressee honorifics ('polite', Teineigo): to show respect towards the listener Note that the expressions of deference are by nature made essentially with human referents, i.e. between an honouree and an honourer.</Paragraph>
      <Paragraph position="2">  However, paying respect often extends to things and events related to the honouree in Japanese. This is often expressed with an honorific prefix o- or go- to the nouns. For example, the passage (3) is used by train conductors for ticket inspection. The use of non-subject honorific form means that the unexpressed subject (i.e. the train conductor) is showing respect towards the tickets, which belong to or have some relation to his honourees (i.e. the passengers).</Paragraph>
      <Paragraph position="4"> '(An honourer) is going to inspect (his honourees') tickets.' .notdef.g0001'Let me inspect your tickets, please.'</Paragraph>
    </Section>
    <Section position="2" start_page="92" end_page="94" type="sub_section">
      <SectionTitle>
3.2 Forms of honorifics
</SectionTitle>
      <Paragraph position="0"> Honorifics in Japanese take various forms that are reflected in the word forms, either in lexical choice or in inflections - verbs in Subsection 3.2.1, adjectives and nouns in Subsection 3.2.2, and also sentence structures in Subsection 3.2.3.</Paragraph>
      <Paragraph position="1">  There are five ways of expressing referent honorification in verbs, depending on the type of verb and the level of respect that is intended.2 Types 3 displays the highest deference, and 3&gt;2&gt;1&gt;4 in descending order. Type 5 displays a formality rather than deference towards the referent. The larger the gap in the hierarchy, the more disparity of referents in rank we expect.</Paragraph>
      <Paragraph position="2"> 3.2.1.1 Type 1: Alternation of verb forms Verbs can be transformed into subject honorific (SubH) and non-subject honorific (NsubH) structures as follows: SubH: o + verb stem + ni naru ('become').</Paragraph>
      <Paragraph position="3"> NsubH: o + verb stem + suru ('do').</Paragraph>
      <Paragraph position="4"> 2 According to Wenger (1983:283-292), 70% of verbs have Subject honorific forms, while only 36% of verbs have non-subject honorific forms. He explains why not all verbs have forms of honorification, although he does not explain why there are fewer non-subject honorific forms.</Paragraph>
      <Paragraph position="5"> Honorification cannot occur, 1) unless the subject is human; this explains why there are no honorific forms for verbs such as kooru 'freeze' and hoeru 'bark'; and 2) on verbs that have negative connotations, such as kuiarasu 'eat greedily'.</Paragraph>
      <Paragraph position="6"> Accordingly, machi 'wait', for example, can be turned into two different forms of honorifics:  (4a) O-machi-ni naru. [SubH] '(An honouree) waits (for someone/something).' (4b) O-machi-suru. [NsubH]  '(An honourer) waits (for an honouree).' The honorific prefix o- can be go-, as shown below. Basically, o- is used for Japanese native verbs and nouns and go- for Sino-Japanese (Chinese originated) words.</Paragraph>
      <Paragraph position="7">  (5a) Go-shichaku-ni naru. [SubH] '(An honouree) tries on (clothes).' (5b) Go-hookoku-suru [NsubH] '(An honourer) reports (to an honouree).' 3.2.1.2 Type 2: Suppletive forms Different lexical items are used for some (more frequently used) verbs. For example, the following examples all mean 'o eat': (6a) Taberu. [non-honorific: neutral] '(Someone) eats.' (6b) Meshiagaru. [SubH]  '(An honouree) eats.' (6c) Itadaku. [NsubH] '(An honourer) eats.' Table 1 shows some examples of other suppletive forms of honorification.</Paragraph>
      <Paragraph position="8"> Neutral SubH NsubH do suru nasaru itasu exist/stay iru irassharu/ o-ide-ni-naru oru go iku irassharu/ mairu/ o-ide-ni-naru ukagau come kuru irassharu/ o-ide-ni-naru mairu say iu ossharu moosu eat/drink taberu/nomu meshiagaru itadaku  Notice that some honorific forms are shared by very different meanings of verbs. For instance, irassharu can mean either 'come', 'go', or 'stay'. The nature of honorification is said to be indirect in expression. This semantic neutralization poses problems in machine translation outputs from Japanese.</Paragraph>
      <Paragraph position="9">  3.2.1.3 Type 3: Combination of Types 1 &amp; 2 This usage is restricted to some verbs, for example: (7) O-meshiagari-ni naru.</Paragraph>
      <Paragraph position="10"> '(Someone highly respected) eats.' 3.2.1.4 Type 4: Use of passive form -rare The passive -rare is suffixed to the verb stem to display subject honorifics instead of the passive interpretation; for example: (8) Tabe-rare-ru.</Paragraph>
      <Paragraph position="11"> '(An honouree) eats.' c.f. (6a) Taberu. [non-honorific: neutral] '(Someone) eats.' Note that there are no corresponding constructions of Types 3 and 4 for non-subject honorific forms.</Paragraph>
      <Paragraph position="12"> 3.2.1.5 Type 5: Lexical semantics The semantics of some verbs give rise to referential restrictions, in that the subject must be higher or lower than the non-subject referent. This has been neglected in previous studies of honorification. Analogous to the example in (1), insotsu$&amp;quot;'take' has a restricted usage as '(a higher ranked person) leads (a group of lower ranked people).' We used Lexeed (Bond et al. 2004) - a manually built self-contained lexicon, to extract verbs and verbal nouns with such referential restrictions. It consists of words and their definitions for the most familiar 28,000 words in Japanese, as measured by native speakers. This set is formulated to cover the most basic words, which cover 72.2% of the words in a typical Japanese newspaper. Since honorification tends to be found more in sophisticated words than in basic words, we used those extracted verbs as seeds to expand the list using the Goi-Taikei thesaurus (Ikehara et al. 1997).</Paragraph>
      <Paragraph position="13"> For example, the semantic class meirei 'command' (Class Number 1824) lists synonyms, such as iitsukeru07D~'tell', and shiji&amp;_'instruct', all of which exhibit the same referential restriction: a high ranked person as the subject and a low ranked person as the object. However, this is not always the case. For instance, kyoka0'permit' (Class Number 1735) includes as its synonyms dooi'agree' and sansei2.:'agree/approve' that do not exhibit the same referential restriction as kyoka. We manually extracted from Lexeed 698 such verbs (397 of these are 'a higher ranked person does to a lower person' and the rest 301 are the reverse), and from Goi-Taikei further 429 (228, 201 respectively), 1127 in total.</Paragraph>
      <Paragraph position="14">  Honorification is also expressed on nouns (including verbal nouns) and adjectives by the honorific prefix o- with variants on-, go-, and mi. Honorific prefixes have four functions:  [1] An entity/action belongs to the honouree. [2] An entity/action has an implication to the honouree, even when it belongs to the speaker.3 [3] Addressee honorifics to show formality of speech/politeness to the addressee.4 [4] Conventional usage5  The use of the honorific particles in [1] provides important information on the type of referents. Possessors are seldom expressed and there are no definite/indefinite articles in Japanese (Bond 2005), but honorific particles can take on these functions. For example, onimotsu (honorable luggage) means 'your/his/... luggage', and go-ryokoo means 'your/his/... trip'. Although the exact identity of the honoreepossessors is context dependent, as the following minimal pair of sentences show, in (9a) the possessors can never be the speaker or the speaker's in-group member (see Subsection 4.2 for 'in-group'), as indicted by *. In contrast, the identity of the subject, as in (9b) without an honorific particle, is generally the speaker or his in-group member.</Paragraph>
      <Paragraph position="15"> 3 For example, o-tegami (literally, 'honourable letter') is used when the letter is something to do with the honouree; it could be the letter that the honouree wrote, a letter sent by someone else to the honouree, or a letter written by the speaker to the honouree.</Paragraph>
      <Paragraph position="16"> 4 For example, o-hana (flowers) and o-shokuji (meal) are such cases where possession is not a concern. 5 The standard example of this type is go-han 'honourablerice' meaning 'rice/meal'. Such honorific particles do not convey honorifics, but are seen as part of set phrases.  Honorification is manifested also in the choice of sentence structure. The causative construction can be used only when the causer is superior in social hierarchy to the causee, as shown in (10). If the causee is equal to or superior over the causer, the benefactive construction is used, conveying the same proposition with the connotation that the causee has accepted the causer's request instead of command, as in (11). Thus, the sentence structure reveals the referential disparity in rank.</Paragraph>
      <Paragraph position="18"> '(I requested my teacher to read the book for me, and) my teacher read the book for me.'</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="94" end_page="96" type="metho">
    <SectionTitle>
4 Ranking factors
</SectionTitle>
    <Paragraph position="0"> Section 3 explained the various forms that indicate disparity of referents in rank. This section describes three factors that induce such disparity in rank: Social hierarchy, in-group and out-group distinction, and unfamiliarity of the addressee.</Paragraph>
    <Section position="1" start_page="94" end_page="94" type="sub_section">
      <SectionTitle>
4.1 Social hierarchy
</SectionTitle>
      <Paragraph position="0"> Social hierarchy is the core rank-inducing factor, which can be overridden by the other two factors. It refers to social ranks in such social settings as company, school, family, as well as general age/generational rank. For example, an employer is perceived as ranked higher than his employees, and a teacher is higher than his students, and the older a person is, the higher he is ranked.</Paragraph>
      <Paragraph position="1"> Social hierarchy functions similar to the Subject-Verb agreement in terms of person, number and gender seen in many European languages. Although Japanese has no syntactic coding of such a S-V agreement, verbs agree with the referential relation of the subject and other referents in terms of social hierarchy (the same view is held by Pollard and Sag 1994).</Paragraph>
    </Section>
    <Section position="2" start_page="94" end_page="95" type="sub_section">
      <SectionTitle>
4.2 In-group and out-group distinction
</SectionTitle>
      <Paragraph position="0"> Referents are also classified according to the in-group and out-group distinction, depending on the social relation among three parties: the speaker, the addressee, and the people being referred to.6 For example, in (12) an officer of a company (the speaker) talks about the president of his company (referent) to his boss (addressee). The officer is ranked lower than the president and his boss, and accordingly the subject honorific and addressee honorific ('Polite') are used. However in (13), when he reports the same proposition to people outside the company, the president is regarded as a 'in-group' member to the speaker, and therefore the description of him uses the non-subject honorific form of verb, the same as the speaker would use to describe himself. In other words, the rank assigned from social hierarchy is overridden by the in-group and out-group distinction.</Paragraph>
      <Paragraph position="1">  Thus, the dichotomy of in-group/out-group distinction is relative. This is prominently seen in the use of family terms, as shown in Table 2. When someone talks to her/his mother or about her with her/his family, 'mother' is referred to as  6 Generally, the type of honorific use is also determined by the three parties. To be more precise, setting and bystander also play a part in determining the type of honorifics to be used (Brown and Levinson 1987).</Paragraph>
      <Paragraph position="2">  okaasan, using the out-group (OG) form, while when talking about her to outsiders, she is referred to as haha, using the in-group (IG) form.</Paragraph>
      <Paragraph position="3"> There are three lexical types that reflect the in-group and out-group distinction.</Paragraph>
      <Paragraph position="4"> 1) the deictic prefixes: too- hon-, hei-, setu, etc. for the in-group use, translated into English as 'my/our', and ki-, o-, on-, etc. for the out-group use, translated as 'your/his/her/their'. For example, too-koo (my/our school) versus kikoo (your/their school).</Paragraph>
      <Paragraph position="5"> 2) the suffixes -san/-sama/-dono: for instance, gakusei 'a student' is referred to as gakusei-san out of deference to a respected out-group person (e.g. 'your students', 'student of your school').</Paragraph>
      <Paragraph position="6"> 3) suppletive forms: some examples are shown in Table 2.</Paragraph>
    </Section>
    <Section position="3" start_page="95" end_page="96" type="sub_section">
      <SectionTitle>
4.3 Unfamiliarity of the addressee
</SectionTitle>
      <Paragraph position="0"> In apparent absence of disparity in social ranking and age, honorifics can still be used subjectively in formal settings, when communicating with unfamiliar people, particularly by female speakers.</Paragraph>
      <Paragraph position="1"> 5 Assigning referents with ranks using Goi-Taikei thesaurus In order to make use of honorific information, each referent must be assigned with a rank to determine which referent in a sentence is ranked the highest. We use Goi-Taikei for this assignment (Ikehara et al. 1997). It has a semantic feature tree with over 3,000 nodes of semantic classes organised with a maximum of 12 levels (see Figure 1). It includes in its semantic classes information on occupational status, generation, family composition; the sort of information needed for this assignment.</Paragraph>
      <Paragraph position="2">  In addition, the following two tasks are required: 1) Group some semantic classes together from different nodes.</Paragraph>
      <Paragraph position="3"> For the honorific use, some semantic classes that are scattered over different nodes in the tree should be grouped together. For instance, the information relevant to Social hierarchy is found not only under Occupation (status) but also under Organization, Family, and so forth.</Paragraph>
      <Paragraph position="4"> 2) Rank the semantic classes where relevant. Figure 2 is a preliminary result showing ranks of referents in selected semantic classes, noted as class names followed by their semantic class numbers in Goi-Taikei listed in ascending order.</Paragraph>
      <Paragraph position="5">  This list needs to be expanded. As the list is taken exhaustively from Goi-Taikei, these entries must be augmented with other thesauri, organisation charts, genealogical trees, and other ways as well as by hand.</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="96" end_page="97" type="metho">
    <SectionTitle>
6 Calibration of ranks
</SectionTitle>
    <Paragraph position="0"> Ranks of referents are not absolute, but relative to the other referents in the sentence. For example, an adult referent is ranked higher than a youth, but the same referent is ranked lower when appearing with an elderly in the same sentence. Similarly, manager in a company is higher than workers with no title, but the same manager is lower than the company president.</Paragraph>
    <Paragraph position="1"> Thus, the calibration of rankings is necessary.</Paragraph>
    <Paragraph position="2"> However, calibrating ranks while capturing relative ranks is an extremely complicated task, as any combination of referents and ranking categories can appear in a sentence as well as the fact that one referent may belong to multiple categories. For example, a measure has to be taken in case one referent is ranked in one string (e.g. 'minister') than the other referent (e.g. 'clerk') in the same sentence, but he is lower in another string (e.g. 'less experienced' in the profession or younger in age) (see Figure 2); or when the in-group and out-group distinction takes the precedence in the form of honorifics, for instance a referent is senior than the other referent, but he is an in-group member to the speaker.</Paragraph>
    <Paragraph position="3"> More complicated still, within the same class, there may exist a disparity in rank. For example, the age difference, even by one year, can determine the use of honorifics, so that honorifics is used between two referents under the same class adult.</Paragraph>
    <Paragraph position="4"> Considering the above, we propose the following calibration scheme as an initial step of dealing with the complex phenomena of honorifics.</Paragraph>
    <Paragraph position="5"> [1] create referential links, example modules of which are suggested in Figure 3. Each string of ranks in Figure 2 constitutes a module, which is connected to another module. Figure 3 shows that a referent 'JOHN' is a student as well as a child of his parents that is depicted. JOHN belongs to other modules of strings; he may be an elder bother at home, and may be a senior student at school, each of which is a member of a module and is connected to other modules.</Paragraph>
    <Paragraph position="6"> Connections between two modules may be more than one, for example, 'grandparent' may be a teacher of 'teacher' of JOHN.</Paragraph>
    <Paragraph position="7"> It is necessary to identify as many modules as identifiable and to link them in order to accurately determine the ranks of each referent for a sentence.</Paragraph>
    <Paragraph position="8">  [2] a diagram for calibrating ranks Figure 4 is proposed to capture the mechanisms of honorifics that determine the ranks of referents for a sentence.</Paragraph>
    <Paragraph position="9"> The referential links are the first (core) rank determining factor. When one referent belongs to multiple strings, for instance, a string from Social hierarchy and another from Age, then the former takes the higher rank, which is noted as 'Social &gt; Age'. The case where two referents belong to the same class but still appear with honorifics is due to the subtle difference in rank, noted as 'The same class'.</Paragraph>
    <Paragraph position="10"> These ranks assigned by the referential links can be overridden by 'in/out group precedence', which is determined by the type of modules, as shown in Figure 3.</Paragraph>
    <Paragraph position="11"> The use of honorifics in absence of disparity in social rank is interpreted as lack of familiarity of the addressee.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML