File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/82/c82-1017_metho.xml
Size: 17,075 bytes
Last Modified: 2025-10-06 14:11:23
<?xml version="1.0" standalone="yes"?> <Paper uid="C82-1017"> <Title>ON THE ROLE OF THE HIERARCHY OF ACTIVATION IN THE PROCESS OF NATURAL LANGUAGE UNDERSTANDING</Title> <Section position="1" start_page="0" end_page="107" type="metho"> <SectionTitle> ON THE ROLE OF THE HIERARCHY OF ACTIVATION IN THE PROCESS OF NATURAL LANGUAGE UNDERSTANDING </SectionTitle> <Paragraph position="0"/> <Section position="1" start_page="0" end_page="107" type="sub_section"> <SectionTitle> Czechoslovakia </SectionTitle> <Paragraph position="0"> The elements of the stock of knowledge shared .by the speaker and the hearer change their salience, in the sense of being immediately accessible in the bearer's memory. The hierarchy of salience is argued to be a basic component of a mechanism serving for the identification of reference. Some of the regularities of this mechanism are discussed, the description of which is a necessary prerequisite of an automatic understanding of connected texts.</Paragraph> <Paragraph position="1"> i. When discussing the question of what is the main contribution of computational linguistics to theoretical linguistics at one of the panel discussions at Coling 80 in Tokyo, the panelists unanimously pointed to the due emphasis laid on the function of natural language in communication. In this direction, it is known from the classical Prague School that the topic/focus articulation belongs to the most important aspects connecting the structure of language with the conditions of its use. However, this articulation I was not examined systematically in most of the main linguistic trends, so that the pioneers who are aware of its importance for natural language understanding (esp. Grosz, 1977, Reichman, 1978, 1981, McKeown, 1979, Hirst, 1981) attempt to find their own ways.</Paragraph> <Paragraph position="2"> During the discourse the stock of &quot;knowledge&quot; the speaker assumes to share with the hearer changes according to what is &quot;in the centre of attention&quot; at the given time point. 2 Each utterance has its influence on this hierarchy of salience; however, not every mentioning of an object has the same effect. The assumption that the degrees of salience constitute a partial ordering is corroborated e.g. by the degrees of consciousness characterized by Chafe (1974).</Paragraph> <Paragraph position="3"> Let us denote by x ~ an expression x referring to an object a that is salient to the -qa- degree n in the stock; since the maximum of salience can more easily than other degrees be imagined to be fixed, we denote it by ~ = ~, reversing the direction of&quot;growth&quot; of the degrees; to the left (right) of the arrow we indicate the state immediately preceding (following) the utterance of a sentence S in which x occurs; x c F (T) stands for &quot;x belongs to the focus (~opic) of S&quot;;--P(x ) denotes that x is expressed either by a weak pronoun or Ts d~le~ed in S, though--present in its tectogrammatical represent~ ation (TR);3 with--NP. _(x ) we refer to definite NP's and to such _--~a~e a NP's as one of the N, ~o~ (of the) N.</Paragraph> <Paragraph position="4"> According to a preliminary empirical investigation it appears that:the following rules concerning the degrees of salience obtain 108 E. HAJI~OVA and J. VRBOVA (cf. a first formulation of some of them in Sgall, 1980a, p.240), where m, n > ~: (i) If P(X_a), then a_n- + a_ n(2) If NP(x a) C \[, then a_ n- / a~ (3) If NPdef(~a) C ~, then a_n-- + 1 (4) If a_n-- + a_ m-, then b_ m-+2 obtains for every object b that is not itself referred to in the TR of S, but is immediately associated with it (as e.g. teacher or pupil are with schoo l, or parents with child, e--t-~..) (5) If As for xa, or C0noernin~ x a is present in S, then a ~ + a I ~ (6) If ~a is not included in the TR of S, not even among the associated objects (see Rule (4)), then a ~ + a n-+2 As was pointed out by Sgall (1967a, p.95f.; 198Ob), the difference between n = ~ and n = 1 is too small to make the reference assignment un\[vocal <if--such specific pronouns as Czech ten, German der or Engl. the latter are not present): If Charles met PAUL is f-o\[lowed by He wore his HAT, we may only guess whether he refers to Paul and his to Charles, or vice w~rsa, or whether a third possibility is present. However, whenever the difference in the degrees of salience is greater, an occurrence of a pronoun or of a definite NP in the next utterance regularly refers - as we want to illustrate in the sequel - to that object which was referred to as the most salient among those that meet the lexical and grammatical conditions of the NP or pronoun <number, gender). The two main types of the thematic progressions described by Dane~ (1968; 1976) are important here. Also Hobbs (1976) specifies a common heuristics for resolving pronouns, which says that we should prefer the subject to an NP in the object position (in other words, since the subject position in English functions as a rule as (a part of) the topic of the sentence, this heuristics says that the tJpic is preferably preserved in successive sentences). Another possible strategy <Hobbs, 1979) includes inferencing and factual knowledge.</Paragraph> <Paragraph position="5"> In our rules some of the relationships between the salience and the pragmatic relationships of lexical units are treated only in a preliminary way. Thus e.g. Rule (4) or a similar rule should reflect also a rather common situation when a mentioning of a particular ob- null ject (or objects) brings into foreground only a fraction of a set of objects that already are in the fore~round of the stock of knowledge. It does not capture the case when the foregrounded object(s) belong to a set the activation of which has already faded away under the threshold of activation; it seems to be a plausible hypothesis that even in such a case the &quot;rest&quot; of the objects are being activated beyond this threshol~ by mentioning the foregrounded object(s), even if to a very low extent.</Paragraph> <Paragraph position="6"> 2. Let us illustrate the above rules on a fragment of text; we are not concerned with a systematic linguistic analysis of the given text here, but we coBcentrate on the changes of activation of the elements of the stock of shared knowledge as reflected (and induced3 by the discourse. We restrict ourselves to objects identified by NP's or pronouns and we leave aside actions and their circumstantial</Paragraph> </Section> </Section> <Section position="2" start_page="107" end_page="107" type="metho"> <SectionTitle> ON THE ROLE OF HIERARCHY OF ACTIVATION 109 </SectionTitle> <Paragraph position="0"> qualifications. The capital letters denote the intonation centre of each clause; for the ease of reference, the clauses are numbered.</Paragraph> <Paragraph position="1"> A partial preliminary discussion of such a text was presented in Haji~ov~ and Vrbov~ (1982).</Paragraph> <Paragraph position="2"> The following comments are numbered according to the number of the clause to which they refer. We omit those clauses the commentary to which follows trivially from the previous comments and from the rules. \[I\] In the opening sentence of the text two objects are introduced: the school garden and the children. Since the latter object is mentioned in the focus of the sentence, it becomes more salient than objects mentioned previously or those activated by their presence in the broader context or situation. Also the associated objects such as school, pupils, and teachers are now relatively salient.</Paragraph> <Paragraph position="3"> \[2a\] The children as a highly salient object in the stock of shared knowledge are referred to in the topic position by a pronoun, which keeps their activation at the same degree (see Rule El)), while the activation of the other objects <Rule (6))fades away.</Paragraph> <Paragraph position="4"> \[2b\] A new object enters the scene: the teachers. However, their appearance on the scene is not completely new and surprising, because the hearer has already been introduced into the scene of school, which has the teachers associated with it (see Rule <4)). This is why the teachers can be understood as referring to the teachers of the children and the school which were already mentioned. In accord-</Paragraph> </Section> <Section position="3" start_page="107" end_page="107" type="metho"> <SectionTitle> ON THE ROLE OF HIERARCHY OF ACTIVATION 111 </SectionTitle> <Paragraph position="0"> ance with Rule (3) the degree of salience of teachers is I. The children remain on the scene - they are reminded by a pronominal reference in the topic and their activation does not fade away (see Rule <i)). It should be noted that even though the pronoun them is in the final position of the sentence, it is preceded by the intonation centre; it is well known that all participants placed after the intonation centre belong to the topic part of the sentence.</Paragraph> <Paragraph position="1"> \[2c\] The pronoun in the topic has an indistinct reference: both teachers and children may be referred to. This indistinctness of reference is difficult to solve: according to the strategies mentioned in Sect.</Paragraph> <Paragraph position="2"> 2 the two objects are salient enough to be referred to by a weak pronoun, and the teachers as well as the children can be excited in the given situation, so that not even factual knowledge with inferencing gives a safe clue. The formulation of the text can be characterized as not successful in this point. However, if the lexical setting of \[2c\] were changed, as in \[2c ~\] and \[2c''\], the indistinctness would disappear, and only one of the two possible candidates of reference would be chosen, namely the teachers in \[2c'\] and the children \[2c''\]: \[2c'\] ... because they were always kind to their pupils.</Paragraph> <Paragraph position="3"> \[2c''\]... because they were too small to understand the seriousness,..</Paragraph> <Paragraph position="4"> \[3\] A new object is being introduced in the focus of the sentence; again, it can be said that it is not completely new, because it is also given by the broader scene <of family relations: children - parents).</Paragraph> <Paragraph position="5"> This newly introduced item receives a very high degree of activation, while the other two objects - the children and the teachers - are not mentioned in this sentence at all and thus their activation fades away (see Rule <6)).</Paragraph> <Paragraph position="6"> \[4\] The speaker chooses a subset of the most activated set of elements and speaks about this subset; he introduces a new object on the scene, which receives a high degree of activation (see Rule <2)), but since it does not appear in any of the following utterances, this activation later fades away step by step <Rule (6)>. The complement to the set of five parents - the rest of the pa~ents - retains the degree of a rather high activation, see Rule (3).</Paragraph> <Paragraph position="7"> \[6\] One of the items the activation of which has been fading away <the teachers) is back on the scene; the teachers may be referred to by a definite NP in the topic since the activation has not been lowered under a certain threshold, which would otherwise lead to an explicit reintroduction.</Paragraph> <Paragraph position="8"> \[~9\] An explicit reintroduction of the children is preferred by the speaker since the children have not been mentioned in several preceding utterances; their activation has been lowered considerably. For such strong re-activation expressions such as concernin@, as far as ... is concerned, as for ... are used.</Paragraph> <Paragraph position="9"> \[I0\] Only one subset of the children is explicitly mentioned in this sentence; the activation of this item of knowledge is raised <Rule <3)), but also the complement subset of pupils is &quot;supported&quot; in its activation <Rule <i)), which does not fade away as it would be the case if no pupils were menuioned at all <Rule <6)).</Paragraph> <Paragraph position="10"> \[llb\] The reference of the pronoun ~ is again indistinct; it may refer either to the smallest or to the tall pupils. Neither of the strategies mentioned above helps: both the NP s the tall <pupils) and the smallest <pupils) are in the topic position of \[lla\], and both are activated to the same extent: the smallest pupils were referred to in \[lla\] by pronoun them and the activation'of this item was preserved <Rule (I)) ; the tall were mentioned by a definite NP in the topic, which keeps their activation. This indistinctness remains 112 E. HAJI~OV~, and S. VRBOV.~ unsolved even after the utterance of \[12a\], which adds another indistinct reference <their teachers - whose? of the smallest? of the tall?) and it is only after hearing \[12b\] that the hearer can decide: there the explicit mentioning of the teachers of the small children resolves the indistinct reference of their in \[12a\] and at the same time our factual knowledge of the world invites the inference that if the small children pretended not to be interested, their teachers would have nothing to be proud of, so that they in \[llb\] can refer only to the tall pupils.</Paragraph> <Paragraph position="11"> \[16\] The speaker of the direct speech is not mentioned in the quote (only by means of his reference to the Society of Friends of the School, of which he undoubtedly is a member), but the activation of this item does not seem to fade away. This appears to be a general feature of direct speech, which follows from the semantico-syntactic structure of the direct speech, where the &quot;direct speech clause&quot; functions as an object (Patient) of the introductory clause (and the speaker, as the Actor of the sentence, belongs to the TR of the same sentence).</Paragraph> <Paragraph position="12"> \[27\] The use of the courageous child to refer to the small pupil who screamed &quot;Hurrah!&quot; offers another example of the inferencing the hearer must perform in order to understand what is the sentence about and to determine the reference, even in cases where full NP's rather than pronouns are used.</Paragraph> <Paragraph position="13"> It is also easy to verify that with other variants of the text the weak pronoun they, if it were used after each of the sentences \[5\](e.g. in 6&quot;: The children observed them, ...) , \[6\], \[13\], \[18\], etc., would lack any referential indistinctness; similarly for he after \[14\] <where, in fact, the pronoun is deleted),\[17\], \[22\], etc. The referential indistinctness is present (and must be solved by means of inferencing) after such utterances as e.g. \[2b\], \[9\], \[IO\], \[lla\], \[12a\], where objects with the degrees @ and 1 can be referred to by the same pronoun. In a similar wav the indistinctness of the reference of nouns can be studied,cf, e.g. the use of father in \[19\] (synonymous with he, while in \[20\] the pronoun and in \[27\] the noun is necessary).</Paragraph> <Paragraph position="14"> 3. To conclude, it should be noted that when speaking about the hierarchy of salience (activation) and of degrees of activation of items of the stock of shared knowledge we do not have in mind an absolute scale or linear ordering of elements. The raise or drop of activation should be always taken as related to the other items of the stock. In the rules, we use two kinds of raise: a mild one <Rule <3)), and a strong one <Rule (2)); it may be the case that further empirical research will lead us to a more subtle classification.</Paragraph> <Paragraph position="15"> Also, we are aware that this is a point where linguistic and psychological investigations should go hand in hand. Anyhow, we are cnnvinced that such an empirical research, if guided by an attempt at a more explicit and systematic description, is of primary importance for (automatic) language comprehension, and as such also for all applicational projects of text understanding.</Paragraph> <Paragraph position="16"> should be born in mind that a &quot;given&quot; item may be in contrast, or irrecoverable (in the terms of Halliday), or not identifiable (Chafe), so that a definite noun may well occur inside the focus of a sentence.</Paragraph> </Section> <Section position="4" start_page="107" end_page="107" type="metho"> <SectionTitle> ON THE ROI.~ OF HIERARCHY OF ACTIVATION 113 3 The TR of a sentence has the form of a dependency tree, </SectionTitle> <Paragraph position="0"> labelled by complex symbols with the main predicate as its root; an ordering of the nodes is defined with respect to their so-called communicative dynamism, fulfilling the condition of projectivity; for a detailed description of TR's in the framework of the functional generative description, to which we subscribe, see Sgall (1967a); Haji~ov~ and Sgall (1980); and Panevov~ in this volume.</Paragraph> </Section> class="xml-element"></Paper>