File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/75/t75-2014_intro.xml
Size: 15,315 bytes
Last Modified: 2025-10-06 14:04:14
<?xml version="1.0" standalone="yes"?> <Paper uid="T75-2014"> <Title>GENERATION AS A SOCIAL ACTION</Title> <Section position="3" start_page="0" end_page="67" type="intro"> <SectionTitle> I. INTRODUCTION </SectionTitle> <Paragraph position="0"> This paper is about the &quot;why&quot; and the &quot;how&quot; of natural language generation.</Paragraph> <Paragraph position="1"> Specifically, why does a person choose to communicate one idea rather than another (or none at all), and how does this choice get translated into a particular utterance? I want to present a few of the major issues and then suggest some ways of viewing the problem.</Paragraph> <Paragraph position="2"> In the example above, the child's question is understandable in terms of his wants and fears and beliefs about the world. In order to explain why he asked the question he did, we must view his utterance as an action, rather than Just a string of words. A string of words, per se, is not associated with any plan or goal. But an action is; in fact, the full representation of an action seems to require a representation of both its actual and its intended effects, its actual and its assumed preconditions.</Paragraph> <Paragraph position="3"> This viewpoint further Justifies the study of what Searle, Austin and others have called the &quot;speech act.&quot; The production of a natural language utterance is best understood as an action which alters the state of the world, rather than as a mapping from a given meaning representation into a surface structure. Indeed, many of the problems of deep structure representation, such as focus and presupposition, are more profitably attacked in terms of action and plan structures.</Paragraph> <Paragraph position="4"> Generation can then be seen as a two stage process. First, a plan is formulated which requires a communication with others. Second, the communication is made in terms of conventions which allow complex intentions to be expressed easily. These stages are inextricab2y linked since the language conventions for expressing intentions make implic~\]t references to plans of both the speaker and hearer. At the same time knowledge of the conventions can be used in plan formation, II. GENERATION AS A SOCIAL ACTION Before discussing some of the conventions used to express intentions, it will be useful to consider the notion of a social action. A social action is one whose definition refers to beliefs, wants, fears, or intentions. There may be non-social actions. For example, eating can be described without reference to the diner's beliefs about the substance he puts in his mouth. But the description itself implies beliefs about the action which may or may not be shared. Another observer might say that the alleged diner is just pretending, or picking at the food,, or wolfing it down. A description of eating is a special case of INFORMING. It implies that the speaker believes that the diner is eating, and thus, that he believes that the diner intends to chew, swallow, and whatever else constitutes the physical act of eating. In other words, the act of describing encodes beliefs about plans. Speech is a social action because the definition of a speech act requires reference to the beliefs of both the speaker and the hearer. Implicit in each speech act is the goal of the speaker in making the utterance.</Paragraph> <Paragraph position="5"> There are actions other than speech acts for which the notion of belief and intention are necessary. For example, HELPING is a social action in which the actor does something which furthers a plan inferred for someone else. I believe that generation can best be understood as a social action, i.e., the problem is first to understand how to represent and process any action defined in terms of beliefs and intentions, then to further specify that understanding for language generation.</Paragraph> <Paragraph position="6"> III. HOW TO REPRESENT A SOCIAL ACTION The problem with a social action is that it cannot be represented by any predetermined, finite structure. Instead, its representation requires elements such as beliefs about plans, where plans are themselves defined in terms of beliefs and other plans. Rather than a structure itself, the definition of a social action is best thought of as a set of operations to be performed on a belief system. For example, a REQUEST requires the modification to a belief system such that (equivalently, is valid if) there is a plan of the speaker which has as either a goal or a subgoal a condition which the speaker believes would result from the action requested. Such an operation may require formulation of a plan or modification of existing plans.</Paragraph> <Paragraph position="7"> The recognition and representation of plans is a complex and by no means fully</Paragraph> <Paragraph position="9"> understood process. It requires a variety of types of knowledge to be applied. For instance, motivation rules must be used to determine whether a goal is appropriate for a person. Normative rules are used to account for behavior done under a sense of obligation. Wants are properties of persons which are used in inferring their goals in a given situation. These concepts and others are discussed in the references \[I-3\]. In the remainder of this paper I want to focus on one particular aspect of generation as a social action, namely, &quot;How is intention encoded?&quot; How does a speaker indicate the purpose of his utterance to his listeners? In the example in the beginning of this paper, the child is only partially successful in making his intention known, and to the extent that he fails, he also fails to achieve his goals. In this case at least one of his goals seems to be reassurance that even if he doesn't paint well he won't be rejected.</Paragraph> <Paragraph position="10"> Note that in recognizing his intention, Bruce's teacher also makes other inferences.</Paragraph> <Paragraph position="11"> He/she probably assumes that Bruce believes that he doesn't paint well; that he fears that he may be punished for painting &quot;mean pictures;&quot; and that he believes that the painter of the &quot;ugly picture&quot; stands in the same social relationship to the techer as does Bruce. These inferences are both consequences and determinants of the perceived intention.</Paragraph> <Paragraph position="12"> IV. HOW TO ENCODE INTENTION Presuppositions. What can a speaker do to ensure that his purpose, and consequently, its associated inferences, are communicated to his listeners? One way is to establish, in the discourse previous ~o the utterance, the presuppositions for the purpose. For example, the purpose of REQUESTING INFORMATION has the presupposition that the speaker does not know the information. In the kindergarten example, Bruce is making a different REQUEST, in this case, for reassurance.</Paragraph> <Paragraph position="13"> There is a different set of presuppositions which needs to be established. We can assume that the teacher's familiarity with children entering kindergarten makes it easier for him/her to establish such presuppositions as that Bruce fears that the teacher may do bad things to him. If the listener fails to establish the presuppositions, as Bruce's mother does, then the communication fails. Bruce would need to emphasize his fears to his mother in order to have his utterance understood.</Paragraph> <Paragraph position="14"> Linguistic Conventions. A second way that intentions are encoded is through use of linguistic conventions. For example, to indicate a REQUEST of any kind, the question form is typically used. A request often has a rising intonation, future tense or a modal, inverted word order, or a special word like &quot;please.&quot; Many intentions, such as REQUEST, have a special associated verb, e.g., &quot;I request that you...&quot; Discourse Structure. A third way to encode intentions is to take advantage of higher order linguistic conventions about discourse structure. There are places in a discourse where questions make sense, others where explanations are expected. Knowledge of typical discourse structures allows persons to condense and simplify utterances, avoiding the explicit establishment of presuppositions or explicit use of words like &quot;promise.&quot; While there is probably not a &quot;discourse grammar&quot; which would define &quot;well-formedness of discourses,&quot; it is useful to have a model of how social actions typically fit together, and thus a model of discourse structure. Such a model can be viewed as a heuristic which suggests likely action sequences. By focusing the search involved in recognizing intentions it facilitates generation and subsequent understanding.</Paragraph> <Paragraph position="15"> I have used the term &quot;social action paradigm&quot; (SAP) \[I\] for such a model of the flow of social actions. A SAP is a pattern of behavior (its body) with constraints (its header) on the applicability of the body.</Paragraph> <Paragraph position="16"> The header checks conditions on the situation in which the body is to be applied. At the same time, it binds variables in the SAP body to elements (people, times, locations, things) of the situation. A typical SAP body is shown in the attached figure. In the figure, REQUEST, SUGGEST, PROMISE, etc. are social actions; A and R are persons; and X is an action. F~(X) is an alternative to X; F~(X) is information which relates to the doing of X; and ~ (X) is a reason for not doing X.</Paragraph> <Paragraph position="17"> The SAP body says that A can REQUEST that R do X. Following the REQUEST, R may SUGGEST an alternative to X, may PROMISE to do X, may do X, may REFUSE to do X, or may REQUEST additional information. Following R's REFUSAL or inaction, A may DEMAND that R EXPLAIN, and so on.</Paragraph> <Paragraph position="18"> Both speakers in a discourse can be expected to know various SAP's. For instance, knowing that a SUGGESTION often follows a REQUEST it is not necessary to encode the SUGGESTION explicitly. A person does not have to say, &quot;I suggest instead that you...&quot; V. HOW TO FIT IT ALL TOGETHER The principal point of this paper is that generation needs to be understood as an action in a social context. Let us examine such a context to see how a person's plan is carried out by encoding his intentions.</Paragraph> <Paragraph position="19"> The context: Bill and Catherine are growing a vegetable garden. They have planted the seeds and have seen the first plants appear. The rhubarb is being attacked by small insects which have eaten holes in the leaves. Catherine notices the holes.</Paragraph> <Paragraph position="20"> A REQUEST R to X I I having rhubarb to eat is threatened by the insects. In this case let's assume that she formulates a plan to poison the insects with Bill's assistance. She assumes that Bill has the same goals as she does with regard to the garden. Furthermore, let's assume that she doesn't know what poison to use but believes that Bill does, and that Bill doesn't know about the holes. Thus Catherine's plan involves INFORMING Bill about the holes so that he will be motivated either to put poison on the rhubarb or to tell her what poison to use. Another appropriate action for .this plan is a REQUEST to Bill to do something about the holes (and the insects).</Paragraph> <Paragraph position="21"> Encoding Catherfne's INFORM/REQUEST: Catherine needs to do two things. One is to give information to Bill which she believes he does not have. This is called an INFORM. The other is to ask Bill to do something on the basis of his new knowledge. This is called a REQUEST. She can do these things with two utterances. However, if she believes that Bill doesn't want the holes, that he will infer that either she needs to know what poison to use or that he must apply the poison himself, and that he believes that she believes these things, then one utterance may be sufficient. Thus Catherine may just say, &quot;Bill, the rhubarb's got holes.&quot; In that case she is relying on shared presuppositions about her utterance to carry the information about intention. On the other hand she could use explicit linguistic conventions as in, &quot;I inform you, Bill, that the rhubarb's got holes. I request that you either tell me which poison to use or apply poison to it yourself.&quot; Bill's plan: While Bill has basically the same goals as Catherine let's assume he doesn't know much about plants, particularly rhubarb. When Catherine tells him that the rhubarb has holes he fails to make the inference that insects are eating the plant. Without that inference her last utterance might appear as an INFORM but not a REQUEST. Thus he has no reason to modify his plans about the garden. However he might well wonder why she said such a thing and formulate a plan to satisfy his curiosity.</Paragraph> <Paragraph position="22"> An action for his plan could be to REQUEST Catherine to explain her last utterance.</Paragraph> <Paragraph position="23"> Encoding Bill's REQUEST: Following an INFORM a common action is a REQUEST to the first speaker to EXPLAIN his INFORM. This fact is expressed in the SAP's which include INFORM's. The general expectation of such a REQUEST coupled with a commonly used lingustic convention makes it possible for Bill to express his REQUEST succinctly: &quot;So?&quot; Catherine's plan: Realizying that Bill misses the point of her INFORM/REQUEST Catherine also realizes that her plan needs further action. She has to infer from Bill's REQUEST that he is not making the appropriate inferences himself and needs to be told directly that there are insects on the rhubarb which need to be poisoned.</Paragraph> <Paragraph position="24"> INFORM/REQUEST: Catherine still believes that Bill will recognize her implicit REQUEST and that the problem with her first utterance was that facts were left out which Bill needed. Thus she says, &quot;It's covered with little bugs!&quot; VI. CONCLUSION The little dialogue introduced above could be continued in any of several directions. One likely continuation might be: &quot;I guess we oughta dust it then.&quot; &quot;I don't know what to use.&quot; &quot;How about the rose bush powder?&quot; &quot;On rhubarb?&quot; &quot;Sure, a bug's a bug.&quot; &quot;OK. But you do it. I don't know how much to use.&quot; For each utterance in this sequence there is an associated plan and set of beliefs. At the same time there is heavy use of presuppositions, linguistic conventions, and SAP's to improve the speed and ease of communication.</Paragraph> <Paragraph position="25"> These comments provide only a partial answer to the question of why Catherine Says, &quot;The rhubarb's got holes,&quot; or why Bruce says, &quot;who made these ugly pictures?&quot;, They give but a sketch of how words are selected to encode intentions. I hope, though, that the comments have supported a consideration of generation as a social action occurring in the context of the speaker's and listener's intentions and beliefs.</Paragraph> </Section> class="xml-element"></Paper>