File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/03/w03-0905_metho.xml
Size: 17,249 bytes
Last Modified: 2025-10-06 14:08:26
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-0905"> <Title>The Genesis of a Script for Bankruptcy in Ontological Semantics</Title> <Section position="4" start_page="3" end_page="3" type="metho"> <SectionTitle> 2 Scripts in Ontological Semantics </SectionTitle> <Paragraph position="0"> In order to represent the meaning of connected text, not simply that of a sequence of ostensibly independent sentences, several things must happen. One of the most obvious connections across sentence boundaries is coreference. The TMR in ontological semantics allows for the specification of co-reference, and special procedures exist for treating at least facets of this phenomenon in extant applications of ontological semantics (see Nirenburg and Raskin, 2003, Section 8.6.1). Discourse relations among propositions can also hold across sentence boundaries, and ontological semantics includes facilities for both detecting and representing them as well (ibid, Section 8.6.3).</Paragraph> <Paragraph position="1"> There are, however, additional strong connections among elements of many texts. These have to do with the understanding that individual propositions may hold well-defined places in &quot;routine,&quot; &quot;typical&quot; sequences of events (often called complex events, scripts or scenarios) that happen in the world, with a well-specified set of object-like entities that appear in different roles throughout that sequence. For example, if the sequence of events describes a state visit, the &quot;actors&quot; may, under various circumstances, include the people who meet (the &quot;principals&quot;), their handlers, security personnel and journalists, possibly, a guard of honor; the &quot;props&quot; may include airplanes, airports, meeting spaces, documents, etc. All these actors and props will fill case roles and other properties in the typical component events of the standard event sequence for a state visit, such as travel, arrival, greetings, discussions, negotiations, press conferences, joint statements, etc. The component events are often optional; alternatively, some component events stand in a disjunctive relation with some others (that is, of several components only one may actually be realized in a particular instantiation of the overall complex event), and their relative temporal ordering may be fuzzy.</Paragraph> <Paragraph position="2"> Such typical scripts can be expressed in natural language using expository texts or narratives, sets of the above (indeed, one conceptual story can be &quot;gathered&quot; from several textual sources), plus text in tables, pictures, TV and movie captions, etc. The notion of script is clearly recursive, as every component event can itself be considered a script, at a different level of granularity. The notion of script, under a variety of monikers, was popularized in computer science by Minsky (1975), Schank and Abelson (1977), Charniak (1972), and their colleagues in the 1970s. However, at that time, no realistic-size implementation of natural language processing using scripts could be undertaken, in part, because there was no clear idea about the required inventory of knowledge sources, their relations and content. Script-based theories of semantics were proposed in theoretical linguistics (Fillmore 1985, Raskin 1986) but were overshadowed by the fashion for formal semantics, which is not descriptive in nature, focusing instead on the elements of semantics capturable by such logical devices as quantifiers (see Raskin 1994). Moreover, the size of the task of creating the ontological semantic knowledge sources, which are a sine qua non of script representation, was at the time underestimated by the practitioners and overestimated by critics. It can be said that ontological semantics is a descendant of the script-oriented approach to natural language processing, especially in the strategic sense of accentuating semantic content, that is, the quantity and quality of stored knowledge required for descriptions and applications. Ontological semantics certainly transcends the purview and the granularity levels of the older approach as well as offering an entirely different take on coverage of world and language knowledge and on its applicability. Ontological semantics has also the advantage of having been implemented, tested, and (constantly) improved in actual applications.</Paragraph> <Paragraph position="3"> In the script-based approach to processing text inputs, the scripts in the ontology that get instantiated from the text input provide expectations for processing further sentences in a text. Indeed, if a sentence in a text can be seen as instantiating a script in the nascent TMR, the analysis and disambiguation of subsequent sentences can be aided by the expectation that propositions contained in them are instantiations of event types that are listed as components of the activated script. Obviously, the task of activating the appropriate script from the input is far from straightforward. Also, not all sentences and clauses in the input text necessarily fit a given script--there can be deviations and fleeting extraneous meanings that must be recognized as such and connected to other elements of the TMR through regular discourse relations, that is, through a weaker connection than that among the elements of a complex event.</Paragraph> <Paragraph position="4"> Scripts usually describe situations with multiple agents. Each of these agents can be said, in some sense, to carry out their own plans that are made manifest through the reported component events in a script. Plans are special kinds of scripts that describe the process of attaining a goal by an agent or its proxies. Goals are represented in ontological semantics as postconditions (effects) of events (namely, steps in plans or components of general scripts). For example, if an agent's goal is to own a TV set, this goal would be attained on a successful completion of one of a number of possible plans.</Paragraph> <Paragraph position="5"> In other words, it will be listed in the ontology as the postcondition (effect) of such events as BUY, BORROW, LEASE, STEAL, MANUFACTURE. Note that the plans can be activated only if all the necessary preconditions for their triggering hold. Thus, the ontology, in the precondition property of BUY, for example, will list the requirement that the agent must have enough money (see McDonough 2000).</Paragraph> <Paragraph position="6"> Manipulating plans and goals is especially important in some applications of ontological semantics, for instance, in advice giving applications where the system is entrusted with recognizing the intentions (goals) of an agent or a group of agents based on processing texts about their behavior. Goal- and plan-directed processing relies on the results of the analysis of textual input, as recorded in the basic TMR, as well as the complementary knowledge about relevant (complex) events and objects and their instances, stored in the ontology and the Fact Database (see Nirenburg and Raskin, 2003, Section 7.2), and instantiated in the extended TMR. It is clear that reasoning based on the entire amount of knowledge in the extended TMR can be much richer than if only those facts mentioned in the input texts were used for inference making. Richer possibilities for reasoning would yield better results for any NLP application, provided it is supplied with the requisite inference making programs, for instance, for resolving translation mismatches. The reason we are making a distinction among NLP applications is the extent to which an application depends on such capabilities. For example, MT practitioners have typically assumed that this application does not really need machinery for inference making. This belief is clearly based on the perception that acquiring the knowledge necessary to support reasoning is prohibitively expensive or even outright infeasible, and therefore one must make do with simpler approaches. Of course, should MT developers be able to obtain such resources, they would use them. Ontological semantics has among its goals that of supplying application builders with exactly this kind of knowledge.</Paragraph> <Paragraph position="7"> A good if unusual example for a family of applications for which knowledge representation at the level of complex events is crucial but where its integration has so far been carefully avoided is computational humor (see, for instance, Raskin 1996; Stock and Strapparava 2002). Here, the analysis and generation of ambiguity--generally the key issue for semantics, and, accordingly NLP--is a key requirement for a text to be funny. Computational humor aims to increase the acceptability of natural language interaction between human and machine by injecting relevant humor into natural language interfaces. The most developed theory of humor--not only in the framework of script-based semantics--and the formalized model based on it (Attardo and Raskin 1991) have at their core the notion of incongruity conceptualized as two partially overlapping scripts in a relation of opposition. Earlier attempts at computational humor have simply hardwired two complex events in such a relation into templates (Raskin and Attardo 1994; Binsted and Ritchie 1997) instead of aiming at true generation based on an operational model of humor. And true humor generation is impossible without the identification of scripts in an appropriate relation of opposition, e.g., sexual vs. religious. On this basis, the overlap between the two scripts can be analyzed and generated, for example, in puns, where one lexical item from each of the two scripts are brought to overlap in one surface form through sound similarity (Hempelmann 2003). This is just an example--esoteric, as it may seem to some--of how the set of scriptrequiring NLP applications may expand with time and need.</Paragraph> <Paragraph position="8"> Obviously, as mentioned above, in addition to the knowledge, efficient reasoning procedures must be developed. Such procedures must conform to a number of constraints, an example of which is the following. It is common knowledge that, unless a limit is imposed on making inferences from knowledge units in rich knowledge bases, the inferencing process can go too far or even not halt at all. In advanced applications, for example, advice giving again, a good candidate for such a limit is deriving the active goals and plans of all relevant agents in the world. However, even applications that involve more or less direct treatment of basic text meaning, such as MT, will benefit from making fewer inferences. There will always be difficult cases, such as the need to understand the causal relation in The soldiers fired at the women and I saw some of them fall to select the correct reference for them--in Hebrew, for example, the choice of the pronoun (the masculine otam or the feminine otan will depend on the gender of the antecedent). Such cases are not overly widespread, and a prudent system would deliberately trigger the necessary inferences when it recognizes that there is a need for them. In general, any event is, in fact, complex, that is, one can almost always find subevents of an event; whether and to what extent it is necessary to represent it as a script is a matter of grain size dictated by whether an application needs this information for reasoning, and that, in turn, is largely determined by the nature of the corpora in the domain served by the application.</Paragraph> </Section> <Section position="5" start_page="3" end_page="3" type="metho"> <SectionTitle> 3 Format of Scripts </SectionTitle> <Paragraph position="0"> Scripts are represented in ontological semantics using the ontological property HAS-PARTS. It has temporal semantics if it appears in events, and spatial semantics if it appears in physical objects, e.g., to indicate that an automobile consists of an engine, wheels, the chassis, etc. The properties PRECONDITION and EFFECT also carry information necessary for various kinds of reasoning and apply to any event, complex or otherwise. Scripts require an extension to the specification format. The reason for that is the need to bind the case roles and other property values in component events to establish co-reference. Also, the HAS-PARTS slot of scripts should allow for the specification of rather advanced combinations of component events. Therefore, the format of the filler of HAS-PARTS in scripts should allow a) Boolean operators and, or and not, as well as IF, THEN/ELSE and b) loop statements. Scripts also need statements about partial temporal ordering of their components. For this purpose, a special new property, COMPONENT-RELATIONS is introduced.</Paragraph> <Paragraph position="1"> Component events in a script have a peculiar status.</Paragraph> <Paragraph position="2"> They are not regular instances of concepts, as no instantiation occurs in the ontology--instantiation is one of the two main operations in generating TMRs, the other being matching selectional restrictions in order to combine individual concept instances--but their meaning is different from that of the general concepts to which they are related. Thus, asking questions in the context of a class at school is clearly different from the general idea of asking questions. In order to represent this difference, the notion of ontological instance is introduced. In an ontological instance, some properties are constrained further as compared to their &quot;parent&quot; concept. The constraints typically take the form of cross-reference to the filler of another component event in the same script.</Paragraph> <Paragraph position="3"> For reasons of clarity and convenience, instead of describing the component events and component relations directly in the fillers of corresponding slots in the concept specification for the complex event, we use the device of reification by just naming them in a unique way in that location (we identify ontological instances by appending letters, not numbers as in the case of real instances) and describe their content separately, at the same level as the main script. As a result, the format of the ontological description of a script is a set of ontological concept frames.</Paragraph> <Paragraph position="4"> Reification in ontological semantics is a mechanism for allowing the definition of properties on properties by elevating properties from the status of slots in frames to the level of a free-standing concept frame. It is desirable from the point of view of nonproliferation of elements of metalanguage to avoid introducing a concept of, say DRIVER if it could always be referred to as DRIVE.AGENT. However, this brings about certain difficulties. For example, if we want to state that somebody is a driver of trucks, we would have to say that there is an instance of DRIVE in which the THEME is TRUCK and the AGENT is the person in question. There is no direct relationship between THEME and AGENT, and it would take a longer inference chain to realize that TRUCK is, in fact, the value of a property of DRIVER, too, not only of DRIVE. The more properties one would want to add to DRIVER and not to DRIVE, the more enticing it would be to reify the property DRIVE.AGENT and treat it as a separate concept. In principle, we can use reification on the fly, while building a TMR, when we need to add a property to a property, which is prohibited in the static knowledge sources such as the ontology and the lexicon. As we will see in the example below, reification also facilitates the specification of scripts.</Paragraph> <Paragraph position="5"> In the example below, we present a simplified view of the script/complex event TEACH. As illustrated, TEACH has as PRECONDITION two events--that the teacher knows the material and the students do not; as EFFECT, it has the event that the students (now) know the material. The process of teaching is presented as follows: the teacher presents the material to the students, the students ask the teacher questions about this material, and the teacher answers these questions. The above is admittedly a gross simplification of the actual state of affairs but will serve well for the purposes of illustration.</Paragraph> <Paragraph position="6"> The ontological instances introduced in the process are: TEACH-KNOW-A, -B AND -C, TEACH-DESCRIBE, TEACH-REQUEST-INFO, TEACH-ANSWER, TEACH-AFTER-A AND -B. The constraints in these instances are all refer null ences to fillers of slots in other components of the script or the script itself. Reference is expressed using the traditional dot notation (m.s[.f] is read as 'the filler of the [facet f of the] slot s of the frame m'). Ontological instances are not indexed in the Fact Repository. They appear in appropriate slots of scripts and their fillers are all references to fillers of other ontological instances within the same script or the script itself. They are PART-OF (inverse of HAS-PARTS) of the script in which they are listed but instance-of their corresponding basic concept, that is, TEACH-DESCRIBE-A is the first ontologi-</Paragraph> </Section> class="xml-element"></Paper>