File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/91/w91-0210_metho.xml
Size: 23,163 bytes
Last Modified: 2025-10-06 14:12:48
<?xml version="1.0" standalone="yes"?> <Paper uid="W91-0210"> <Title>Lexical Structures for Linguistic Inference</Title> <Section position="4" start_page="102" end_page="104" type="metho"> <SectionTitle> 2 Generative Lexicon </SectionTitle> <Paragraph position="0"> Pustejovsky's Generative Lexicon theory \[Pustejovsky, 1991\] rejects the characterization of a lexicon as a static listing of word senses and focuses instead on elucidating the set of generative devices which give rise to word senses and allow for the flexible and creative use of language.</Paragraph> <Paragraph position="1"> Generative Lexicon theory postulates a number of representational subsystems for the description of lexical items and the rules of syntactic and semantic composition. These include, in addition to argument and event structures, (1) a qualia structure, which partitions the aspects of a noun's meaning into formal, constitutive, agentive, and telic roles, or qualia, (2) a lexical inheritance network reflecting the same partition, (3) Lexical Conceptual Paradigms, which describe sets of syntactic behaviors which correspond to lexicai semantic categories, and (4) generative devices for extending the logical senses of lexical items dynamically, as for instance coercion rules, mentioned later in this paper. By attempting to tease apart various aspects of the lexical semantics, the theory provides a new vocabulary with which to approach traditional linguistic problems.</Paragraph> <Paragraph position="2"> To illustrate the use of qualia structures for handling selectional restrictions, consider three schematic qualia structures for food-related nouns: food(x) \[const: phys_obj(x)\] \[formal: \] \[relic: eat(E,z,x)\] \[agentive: cook(w,x), prepare(y,x)\] meal(T,x) \[const: food(y)\] \[formal: amount(x,y)\] \[relic: eat(T,z,x)\] \[agentive: &quot;food\]</Paragraph> <Paragraph position="4"> The empty formal role in the structure for &quot;food&quot; indicates that food is a mass noun and, as the constitutive role indicates, composed of some physical material. The telic role is doing most of the work of defining food as something which is eaten. The agentive role adds that food typically arises by cooking or preparing processes (although this is certainly defeasible).</Paragraph> <Paragraph position="5"> Doughnut also inherits the telic quale from food, but differs from food in that it specifies a value in its formal quale. That is, it &quot;packages&quot; the mass noun in a specific way (cf. Pelletier and Schubert \[1986\] , Krifka \[1987\] ). This in fact, is the structural semantic characterization of a packaging coercion: taking a lexical structure with no formal role, and specifying the formal role (Pustejovsky \[1991\] for details).</Paragraph> <Paragraph position="6"> Now consider tile representation for meal. A meal directly denotes both the event pertaining to thc activity of eating the food, as well as tile food itself. The fact that meal is defined with respect to food is made evident in the fact that the formal role is a certain packaging of an amount of food, while the telic role includes the event denoting the occasion of the eating. Now consider the following sentences making use of these nouns.</Paragraph> <Paragraph position="7"> 1. John ate the doughnut.</Paragraph> <Paragraph position="8"> 2. John ate the meal.</Paragraph> <Paragraph position="9"> 3. John left after the meal.</Paragraph> <Paragraph position="10"> 4. John left after tile doughnut.</Paragraph> <Paragraph position="11"> Tile first problem presented by such data is how to describe the seleetional restriction imposed by the verb eat on its direct object, since it clearly allows for both physical objects and events. The qualia structure suggests a straightforward answer. Instead of stating the restriction in terms of a formal or constitutive property, we may use the telic role. That is, eat may take as direct object any noun whose telic role is eat. This may at first seem like a circular definition, but why should the telic role be any less accessible to the selecting verb than, say, the formal role? This approach also renders it unnecessary to create an explicit subclass of physical objects called edible-objects. The class is implicitly defined via the telic quale.</Paragraph> <Paragraph position="12"> The next problem is how to accomodate doughnut as the object of a preposition selecting for an event. This sentence requires an inference (i.e. that John left after eating the doughnut) that can only be verified in the discourse context. However, the lexical structures do provide a means for quickly identifying the potential interpretation.</Paragraph> <Paragraph position="13"> Pustejovsky \[1989, 1991\], has proposed a system of type coercion to explain how verbs like want and believe can accept arguments of type S, VP, and NP without requiring multiple verb entries in the lexicon. Ite defines type coercion as &quot;a semantic operation that converts an argument to the type which is expected by a function, where it would otherwise lead to a type error.&quot; This principle of type coercion can be applied to coercing an object into an event, as required for our doughnut. The telic and agentive roles identify readily available, definitional events associated with a nominal. In most of the examples of this kind of selectional restriction violation that we have encountered, it is the telic role that fills the expected event.</Paragraph> <Paragraph position="14"> By examing what systematic semantic relationships exist in the grammatical expressions of lexical items, we hope to apply the principles of the generative lexicon to new data and illustrate new phenomena in support of the view of the &quot;non-static&quot; lexicon. Making use of data from human interest &quot;notes&quot; conferences, Wall Street Journal and Time magazine articles, our methodology is to analyze a set of semantically related verbs with respect to their argument types in order to ascertain how the lexical semnantic structure of the verb relates to the range of potential (e.g. metonymic) concepts that may appear in an argument position. 2 We consider first a number of verbs in the domain of music, such as &quot;play&quot;, &quot;perform&quot;, and &quot;strum.&quot; Then we turn to verbs of reporting, as found in newspaper and magazine articles.</Paragraph> </Section> <Section position="5" start_page="104" end_page="106" type="metho"> <SectionTitle> 3 Hierarchical classification and qualia structure </SectionTitle> <Paragraph position="0"> One question is how to assign tile definitional information to the respective qualia. While it is beyond the scope of this paper to adequately address this question, let us suggest that the very structure of the definitions of words often suggests which qualia are of relevance. Consider for example a faceted classification scheme (cf. Vickery \[1975\] ), as one might do for a subject matter indexing task for the music domain (the data presented here is based on an electronic &quot;notes&quot; conference on music): music production creation &quot;compose&quot; performance production means voice &quot;sing&quot; lips &quot;whistle&quot; musical instrument &quot;play&quot; in a particular manner brushing across strings &quot;strum.&quot; sounding strings with a pick &quot;pick&quot; 2We have made extensive use of Levin's \[1990\] verb classification, and the methodology of verbal diathesis therein. purpose entertaining an audience &quot;perform&quot; skill improvement &quot;practice&quot; The category &quot;production means&quot; describes, for example, verbs that incorporate different &quot;tools&quot; for making the music, i.e. sing incorporates in its lexical meaning the use of voice, whistle the use of lips and play the use of an instrument. Note that play does not specify the instrument, it implies an instrument but can take as an argument a specification of the particular instrument. Neither sing nor whistle can take an instrument as argument. The implied argument pertains to the constitutive role.</Paragraph> <Paragraph position="1"> Perform and practice, both purposeful activities by definition, define that purpose in their relic role. Both words select for an event in object position. An interesting difference between these two words is the selectionai restriction on the event type for the object position: perform selects for the result of an accomplishment, practice for an activity. This fact explains a difference in metonymic behavior that we will sketch below.</Paragraph> <Section position="1" start_page="105" end_page="106" type="sub_section"> <SectionTitle> 3.1 Possible violations in object position </SectionTitle> <Paragraph position="0"> We can arrive in a similar mnanner at a categorization of nouns in object position of verbs from our musical domain: Music &quot;packagings&quot; unit tune, song, melody, piece, chord, lick, note, repertoire, arrangement nmsical form jig, horT~pipe, symphony, sonata relation to other performers lead, backup location in a piece ending, intro number of perfomers solo, duet physical presentation line, measure, bar, page musical instruments guitar, piano musical instrument parts string, key, soundboard, bridge genres jazz, blues events containing music gig, concert, job, festival, wedding locations where music is played bar, club, concert hall composers Bach, Beethoven instrument makers Fender, Martin This faceted classification together with data analysis suggests that &quot;music packagings&quot; tune, song, ... are related to packagers \[Pelletier and Schubert 1989\], in fact &quot;packaging&quot; the mass term music into events. This category can be further refined into &quot;accomplishments&quot; (sonata, piece, ...) and &quot;achievements&quot; (chord, note, ... ). This corresponds with the behavior of perform and practice: to practice a chord to practice a high note to perform a sonata to perform a piece This discussion gives only a very rough first impression on the workings of the structures of the Generative Lexicon theory when it comes to violations of seleetional restrictions. We showed, however, that it is not impossible to characterize the common sense knowledge that pertains to linguistic behavior. This is an important goal for a generative lexicon, as it aims to capture exactly that body of knowledge that bears on the linguistic behavior of words (in context) in a structured way, thus constraining common sense inferencing at an appropriate level of granularity.</Paragraph> </Section> </Section> <Section position="6" start_page="106" end_page="109" type="metho"> <SectionTitle> 4 Lexical constraints on common sense reasoning </SectionTitle> <Paragraph position="0"> In the previous section we have shown how to identify and categorize the conceptual or world knowledge that directly bears on the syntactic behavior of verbs towards their direct objects in the musical domain. Let us now consider another semantic field, namely reporting verbs in the newspaper context. These verbs frequently occur with a metonymy in subject position. We show how the violations of the selectional restrictions can be dealt with in the Generative Lexicon framework, producing at the same time a pragmatic coherence constraint for the complement clause.</Paragraph> <Paragraph position="1"> Most utterance verbs can be used in reported speech in newspaper articles, selecting for two arguments, namely a proposition as complement clause and a source as subject. The interpretation of these instances of reported speech is that the journalist quotes or reports the proposition in the complement clause giving as evidence for its credibility or reliability the original source of the reported material. (cf. Bergler, forthcoming) Two examples of reported speech in newspaper text.s are: A Lincoln spokesman said its management &quot;never authorized or participated in any bugging of verb as a reporting verb is sometimes slightly different from its literal sense. We therefore introduce a semantic type REPORTING-VERB, which allows us to define a small part of the lexical hierarchy of reporting verbs as follows: This partial hierarchy is a very crude classification of the four reporting verbs say, announce, claim, and insist. We have indicated a literal word sense where it differs significantly, but have made no attempt to define the respective lexical entries completely. Note that the semantic type REPORTING--VERB specifies the basic &quot;reporting situation&quot;, especially that the telic role (roughly describable as its &quot;purpose&quot;) is that the source has expertise on the topic of the complement clause or at least some (relevant) insight.</Paragraph> <Paragraph position="2"> As reported in Bergler \[1991\] , we find that reporting verbs frequently occur with different kinds of metonymy. The violated selectional constraint is that the source of the utterance, i.e. the agent, is not a single individual human being, but a city or institution or even a building, as illustrated below: 1. Marlin Fitzwater said ...</Paragraph> <Paragraph position="3"> 2. Washington said ...</Paragraph> <Paragraph position="4"> 3. The White Ilouse said ...</Paragraph> <Paragraph position="5"> 4. IBM announced ...</Paragraph> <Paragraph position="6"> All reporting verbs accept metonymic extensions where the original human source is replaced by some significant superclass they fall into. In the case of reporting verbs of special importance is the employer (IBM) or the membership in a group (the French). But we find also the characterization of the source in terms of a role he or she played , as in the witness. This &quot;knowledge&quot; can best be stored in the concept of REPORTING-VERB -- all the verbs defined with a reporting verb sense will then inherit the &quot;permission&quot; to use metonymic extensions to a superclass or a specific role played in an event.</Paragraph> <Paragraph position="7"> But this is not a sufficient restriction on the possible metonymies. Consider the following scenario: a German makes a disparaging remark C/ about tim German government in the United States. The types of expected descriptions of the source, that would be deemed appropriate, might be the following: Hans Glueck said that C/ (Name) A German said that C/ (Nationality) ? A Mercedes Benz employee said that C/ (Affiliation) however, given what C/ denotes the following descriptions would be deemed inapproprite and in fact uninformative (of. Grice \[1967\], Gazdar \[1979\] ) A human said that C/ A vegetarian said that C/ A European said that C/ This example illustrates that not all true descriptions of a source can be used as subject in reporting contexts. The source has to be relevant to the topic of the complement clause. Bergler \[forthcoming\] describes that the lexical realization of the source in newspaper articles generally adds to the evaluation of the credibility or reliability the source has with respect to the topic. Thus a German in general has more competence and thus reliability on issues concerning the German government because of direct affectedness; this fact however is lost if we characterize him (correctly) as a European.</Paragraph> <Paragraph position="8"> This is little surprising; general pragmatic considerations such as Gricean maxims (&quot;Be relevant&quot;, &quot;Be informative&quot;) account for this fact. It has been noticed that it is extremely difficult to put these pragmatic considerations into computational form \[Wilks, 1986\] . We will show here that some pragmatic coherence can, however, be established on the basis of lexical semantics. When analyzing the subjects for all occurrences of seven reporting verbs in a corpus of 250,000 words from the TIME Magazine (1963), we found that a small semantic grammar described the different lexicalizations of the subject, moreover, that grammar shows that the source is often lexcalized as some institution or as holding a particular position. It is beyond the scope of this paper to discuss why this should be so; suffice it to allude to the fact that newspapers and magazines report very frequently on government activity and business developments and that iusider information is considered more valuable.</Paragraph> <Paragraph position="9"> The dilemma here is that a basically pragmatic problem is part of the selectional restrictions of reporting verbs and that therefore the occurring systematicity has to have some reflection in a lexical semantics. What does the Generative Lexicon solution to this problem look like? Let us first state the problem with an example. Announce selects for a subject that has some &quot;legitimization&quot; for making the statement that is repeated or paraphrased in the complement. The relation between subject and complement that constitutes the &quot;legitimization&quot; is established by the projective conclusion space, a device that generates certain semantic relationships between lexical items dynamically, much in the way a semantic net records semantic relationships between words statically. 3 The iexical definition of officiafis in the American lleritage Dictionary is: official adj I. Of, pertaining to, or authorized by a proper authority; authoritative.</Paragraph> <Paragraph position="10"> 2. Formal or ceremonious: an official banquet.</Paragraph> <Paragraph position="11"> n I. One who holds an office or position.</Paragraph> <Paragraph position="12"> 2. A referee in a sport.</Paragraph> <Paragraph position="13"> and in the Longman Dictionary of Contemporary English we find: offlc|al adj of or about a position of trust, power, and responsability: an official position/an official occasion/an official manner o\] speaking. -- opposite: unofficial; compare officious which results in a GL entry of the fornl 4 otfieial(*x*, y) \[Form: human(*x*), organization(y), position(.') ~ in(z,y) has-authority(z) ~i hold(*x*, :)\] aFor detail, see \[Pustejovsky 1991\] \[Telie: work-forC*x*, y), trust(y, *x*)\] \[Constitutive: individual(*x*)\] \[hgentive: ~ y\] The combination Officials announced sounds very appropriate (semantically &quot;comfortable&quot;) regardless of the content of the complement clause, because the semantic selectional restriction imposed by announce is met at the lexicai semantic level by the definition of official as somebody of authority or trust, implicitly &quot;legitimized&quot;. This is why Officials announced sounds more like a phrasal pattern than like an ad hoc generative combination.</Paragraph> <Paragraph position="14"> This is an obvious case where a simple feature matching procedure can establish the coherence relation given two lexical entries. Consider the slightly more complex case of Chrysler officials announced (b, which introduces constraints on the content of the complement, restricting it to matters related to the automobile industry in general. Chrysler officials is a particularization of officials in general and is thus more specific. A more specific source has a more specific domain of expertise (and is thus often more reliable). This means that the lexical coherence between Chrysler officials and announce is established just like above with the additional constraint for the complement to be coherent (i.e. to relate to something about) Chrysler.</Paragraph> <Paragraph position="15"> Lastly, the metonymic use of Chrysler announced ~ causes a violation of the selectional restriction of announce, which leads to a coercion of Chrysler to an equivalent of Chrysler officials s. The interpretation process of the three subject NP phrases Officials, Chrysler officials, and Chrysler in the context of announce is entirely due to the lcxical semantics of the words involved, and has thus far not involved random common sense inference, which is one of the main ideas behind GL (namely to minimize common sense reasoning to the really hard cases).</Paragraph> <Paragraph position="16"> Let us now make the leap to say, a reporting verb that has no explicit semantic constraints. We know that anybody can say anything without in fact being of much consequence. The use of say as a reporting verb in the newspaper context, however, restricts its use to statements where the source has some insight or experience that justifies repeating or rephrasing his/her words. This is a fact related to the concept of REPORTING-VERB, in fact an anchor for coercion.</Paragraph> <Paragraph position="17"> The case of Officials said ~ is analog to the case of Officials announced ~b, with officials being in a position of authority or trust filling the requirement associated with the concept REPORTING-VERB, an inherited and therefore weaker constraint than that for announce. Similarly, Chrysler officials said that fb derives the coherence relation from the concept REPORTING-VERB and constrains the topic of the complement clause to coher with Chrysler.</Paragraph> <Paragraph position="18"> Chrysler said (b is more marked than Chrysler announced qS, but indeed does occur frequently, for example in the Wall Street Journal corpus. Interestingly, we found that in the sentences that contained a company name as subject of say, the complement phrase was frequently referring back to the subject with the pronoun it as in: This enhances the markedness of the sentences enforcing the same coercion based on a stipulation of metonymy as for announce. But note that it is possible to use say in its reporting sense with a much weaker coherence relation between the source and the complement than the one required for announce.</Paragraph> <Paragraph position="19"> We have derived coherence relations between three different source NPs, Officials, Chrysler oJ~icials, and Chrysler to two reporting verbs, namely announce and say from the lexical semantics of the words involved and we have shown how the compositional behavior of the matrix clause in turn introduces topic constraints on the complement. This means that an issue as inherently pragmatic sounding as restricting tile metonymy in subject position of reporting verbs to extensions that bear relevance to the complement clause can partly be resolved on the basis of lexical semantics, when treating the lexicon not as a static, passive list of definitions but a dynamic structure that can incorporate procedures that were previously only found in common sense reasoning systems. The important advantage to pure common sense reasoning systems lies in the fact that the reasoning in GL is constrained by and limited to syntactic and lexical semantic knowledge, incorporating conceptual knowledge only where it bears on language behavior. This results in much cheaper inference mechanisms, but will not explain the truly creative use of language.</Paragraph> </Section> class="xml-element"></Paper>