XML Viewer - p92-1027

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/92/p92-1027_metho.xml
Size: 23,882 bytes
Last Modified: 2025-10-06 14:13:12
<?xml version="1.0" standalone="yes"?>
<Paper uid="P92-1027">
  <Title>A UNIFICATION-BASED SEMANTIC INTERPRETATION FOR COORDINATE CONSTRUCTS</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
A UNIFICATION-BASED SEMANTIC INTERPRETATION
FOR COORDINATE CONSTRUCTS
</SectionTitle>
    <Paragraph position="0"> Internet: park@line, cis. upenn, edu</Paragraph>
  </Section>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This paper shows that a first-order unification-based semantic interpretation for various coordinate constructs is possible without an explicit use of lambda expressions if we slightly modify the standard Montagovian semantics of coordination.</Paragraph>
    <Paragraph position="1"> This modification, along with partial execution, completely eliminates the lambda reduction steps during semantic interpretation.</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="210" type="metho">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> Combinatory Categorial Grammar (CCG) has been offered as a theory of coordination in natural language (Steedman \[1990\]). It has usually been implemented in languages based on first order unification. Moore \[1989\] however has pointed out that coordination presents problems for first-order unification-based semantic interpretation.</Paragraph>
    <Paragraph position="1"> We show that it is possible to get over the problem by compiling the lambda reduction steps that are associated with coordination in the lexicon. We show how our first-order unification handles the following examples of coordinate constructs.</Paragraph>
    <Paragraph position="2">  (1.1) Harry walks and every farmer walks.</Paragraph>
    <Paragraph position="3"> (1.2) A farmer walks and talks.</Paragraph>
    <Paragraph position="4"> (1.3) A farmer and every senator talk.</Paragraph>
    <Paragraph position="5"> (1.4) Harry finds and a woman cooks a mushroom.</Paragraph>
    <Paragraph position="6"> (1.5) Mary gives every dog a bone and some  policeman a flower.</Paragraph>
    <Paragraph position="7"> We will first start with an illustration of why standard Montagovian semantics of coordination cannot be immediately rendered into a first-order  unification strategy. The lexicon must contain multiple entries for the single lexical item &amp;quot;and&amp;quot;, since only like categories are supposed to conjoin. For example, the lexical entry for &amp;quot;and&amp;quot; in (1.1) specifies the constraint that the lexical item should expect on both sides sentences to give a sentence. Moore \[1989\] predicts that a unification-based semantic interpretation for sentences which involve for example noun phrase coordination won't be possible without an explicit use of lambda expressions, though there are cases where some lambda expressions can be eliminated by directly assigning values to variables embedded in a logical-form expression. The problematic example is shown in (1.6), where proper noun subjects are conjoined.</Paragraph>
    <Paragraph position="8"> (1.6) john and bill walk.</Paragraph>
    <Paragraph position="9"> The argument is that if we do not change the semantics of &amp;quot;john&amp;quot; from j to AP.P(j), where P is a second order variable for property in the Montagovian sense 1 , then the single predicate AX. walk(X) should accommodate two different constants j and b in a single variable X at the same time. Since the unification simply blocks in this case, the argument goes, we need to use higher order lambda expressions such as AP.P(j) or AP.P(b), which when conjoined together, will yield semantics for e.g. &amp;quot;john and bill&amp;quot; as</Paragraph>
    <Paragraph position="11"> Combined finally with the predicate, this will result in the semantics (1.7), after lambda reduction.</Paragraph>
    <Paragraph position="12"> (1.7) walk(j) &amp; walk(b) 1Montague \[1974\]. )~pVp(j) to be exact, taking intensionality into account. The semantics of the predicate &amp;quot;walks&amp;quot; will then be (^AX.walk(X)). Although Moore did not use quantified noun phrases to illustrate the point, his observation generalizes straightforwardly to the sentence (1.3). In this case, the semantics of &amp;quot;and&amp;quot;, &amp;quot;every&amp;quot; and &amp;quot;some&amp;quot; (or &amp;quot;a&amp;quot;) will be (1.8) a, b, and c, respectively. null  (1.8) (a) AO.AR.AP.(Q(P) * R(P)) (b) AS. AP'. forall(X, S (X) =&gt;P' (X)) (c) AS.AP&amp;quot;. exists(X,S(X)~P' ' (X))  Thus, after four lambda reduction steps, one for each of Q, R, P' and P' ', the semantics of &amp;quot;a farmer and every senator&amp;quot; will be AP.(exists(X,faxmer(X)RP(X)) forall(X,senator(X)=&gt;P(X))), as desired.</Paragraph>
    <Paragraph position="13"> Moore's paper showed how lambda reduction could be avoided by performing lambda reduction steps at compile time, by utilizing the lexicon, instead of doing them at run time. Consider again (1.8a). The reason why this formulation requires foursubsequent lambda reduction steps, not three, is that the property P should be applied to each of the conjuncts, requiring two separate lambda reduction steps. Suppose that we try to eliminate these two lambda reduction steps at compile time by making the argument of the property P explicit in the lexicon, following the semantics (1.9).</Paragraph>
    <Paragraph position="15"> The first-order variable X ranges over the set of individuals, and the hope is that after lambda reduction it will be bound by the quantifiers, such as forall, embedded in the expressions denoted by the variables Q and R. Since the same variable is used for both constructs, however, (1.9) works only for pairs of quantified noun phrases, which don't provide constants, but not for pairs involving proper nouns, which do provide constants. Incidentally, this problem is particular to a unification approach, and there is nothing wrong with the semantics (1.9), which is equivalent to (1.8a). This unification problem cannot be avoided by having two distinct variables Y and Z as in (1.10) either, since there is only one source for the predicate property for the coordinate noun phrases, thus there is no way to isolate the argument of the predicate and assign distinct variables for it at compile time.</Paragraph>
    <Paragraph position="17"> The way we propose to eliminate the gap between (1.9) and (1.10) is to introduce some spurious binding which can always be removed subsequently. The suggestion then is to use (1.11) for the semantics of &amp;quot;and&amp;quot; for noun phrase conjunction. null</Paragraph>
    <Paragraph position="19"> This satisfies, we believe, the two requirements, one that the predicate have the same form, the other that the variables for each conjunct be kept distinct, at the same time. The rest of the lambda expressions can be eliminated by using the notion of partial execution (Pereira &amp; Shieber \[1987\]).</Paragraph>
    <Paragraph position="20"> Details will be shown in Section 3, along with some &amp;quot;more immediate but faulty&amp;quot; solutions. It is surprising that the same idea can be applied to some fairly complicated examples as (1.5), and we believe that the solution proposed is quite general.</Paragraph>
    <Paragraph position="21"> In order to show how the idea works, we use a first-order Montagovian Intensional Logic (Jowsey \[1987\]; Jowsey \[1990\]) for a semantics. We apply the proposal to CCG, but it could equally well be applied to any lexicon based grammar formalism. We explain briefly how a CCG works in the first part of Section 2. As for the semantics, nothing hinges on a particular choice, and in fact the code we show is devoid of some crucial features of Jowsey's semantics, such as indices for situations or sortal constraints for variable binding.</Paragraph>
    <Paragraph position="22"> We present the version of Jowsey's semantics that we adopt for our purposes in the second part of Section 2, mainly for completeness. In Section 3, each of the cases in (1.1) through (1.5), or variations thereof, is accounted for by encoding lexical entries of &amp;quot;and&amp;quot;, although only (1.3) and (1.5) depend crucially on the technique.</Paragraph>
    <Paragraph position="23"> We have a few words for the organization of a semantic interpretation system we are assuming in this paper. We imagine that it consists of two levels, where the second level takes a scope-neutral logical form to produce every possible, genuinely ambiguous, scoping possibilities in parallel and the first level produces this scope-neutral logical form from the source sentence. We assume that our second level, which we leave for future research, will not be very different from the one in Hobbs &amp; Shieber \[1987\] or Pereira &amp;: Shieber \[1987\]. The goal of this paper is to show how the scope-neutral logical forms are derived from natural language sentences with co-ordinate constructs. Our &amp;quot;scope-neutral&amp;quot; logical form, which we call &amp;quot;canonical&amp;quot; logical form (CLF), syntactically reflects derivation-dependent order of quantifiers since they are derived by a derivation-dependent sequence of combination. We emphasize that this derivation-dependence is an artifact of our illustrative example, and that it is not an inherent consequence of our technique.</Paragraph>
  </Section>
  <Section position="4" start_page="210" end_page="211" type="metho">
    <SectionTitle>
2 Background Formalisms
A Combinatory Categorial Grammar
</SectionTitle>
    <Paragraph position="0"> The minimal version of CCG we need to process our examples contains four reduction rules, (2.1) through (2.4), and two type raising rules, (2.5) and (2.6), along with a lexicon where each lexical item is assigned one or more categories. For the reasons why we need these, the reader is referred  to Steedman \[1990\].</Paragraph>
    <Paragraph position="1"> (2.1) Function Application (&gt;): X/Y Y= =&gt; X (2.2) Function Application (&lt;): Y X\Y =&gt; X (2.3) Function Composition (&gt;B): X/Y Y/Z =&gt; X/Z 2 (2.4) Function Composition (&lt;B): Y\Z X\Y =&gt; XXZ (2.5) Type Raising, Subject (&gt;T): np =&gt; s/(sknp) (2.6) Type Raising, Backward (&lt;T): np =&gt; X\(X/np)  The present fragment is restricted to the basic categories n, np and s. 3 Derived categories, or categories, are recursively defined to be basic categories combined by directional symbols (/or \). Given a category X/Y or X\Y, we call X the range category and Y the domain category. Parentheses may be used to change the left-associative default. The semantics part to be explained shortly, (2.7a) through (2.7e) show examples of a common noun, a proper noun, a quantifier, an intransitive verb, a sentential conjunction, respectively.</Paragraph>
    <Paragraph position="2">  (2.7) Sample Lexicon (a) cat(farmer, n:X'farmer(X)).</Paragraph>
    <Paragraph position="3"> (b) cat(harry, np:AI'(h'B)'B).</Paragraph>
    <Paragraph position="4"> (c) cat(every, np: (X'A)'(X'B)'forall(X,A=&gt;B) /n:X'A).</Paragraph>
    <Paragraph position="5"> 2In Steedman \[1990\], this rule is conditioned by Z s\np in order to prevent such constructs as &amp;quot;*\[Harry\] but \[I doubt whether Fred\] went home&amp;quot; or &amp;quot;*\[I think that Fred\] and \[Harry\] went home.&amp;quot;  (d) cat (walks, s : S\np: (X'A)&amp;quot; (X'walk(X)) &amp;quot;S). (e) cat(and, (s: (St ~ S2)\s:S1)/s:S2),4 A First-Order Montague Semantics In this section, we will focus on describing how Jowsey has arrived at the first-order formalism that we adopt for our purposes, and for further details, the reader is referred to Jowsey \[1987\] and Jowsey \[1990\]. The reader can safely skip this section on a first reading since the semantics we use for presentation in Section 3 lacks many of the new features in this section.</Paragraph>
    <Paragraph position="6"> Montague's PTQ analysis (Dowty, Wall &amp; Peters \[1981\]) defines an intensional logic with the basic types e, t and s, where e is the type of entities, t the type of truth values and s the type of indices. Derived types &lt;a,b&gt; and &lt;s,a&gt; are recursively defined over the basic types. A name, which is of type e, denotes an individual; individual concepts are names relativized over indices, or functions from indices to the set of individuals. Individual concepts are of type &lt;s, e&gt;. A predicate denotes a set of individuals, or a (characteristic) function from the set of individuals to truth values. Properties are intensional predicates, or functions from indices to the characteristic functions. Properties are of type &lt;s,&lt;e,t&gt;&gt;, or &lt;e,&lt;s,t&gt;&gt;.</Paragraph>
    <Paragraph position="7"> A formula denotes a truth value, and propositions are intensional formulas, thus of type &lt;s,t&gt;.</Paragraph>
    <Paragraph position="8"> By excluding individual concepts, we can ensure that only truth values are relativized over indices, and thus a modal (omega-order) logic will suffice to capture the semantics. For this purpose, Jowsey defines two basic types e and o, where o corresponds to the type &lt;s,t&gt;, and then he defines derived types &lt;a,b&gt;, where a and b range over basic types and derived types. The logic is then made into first-order by relying on a fixed number of sorts and eliminating recursively defined types. These sorts include e, s, o, p and q, which correspond to the types e, s, &lt;s,t&gt;, &lt;e,&lt;s,t&gt;&gt; and &lt;&lt;e,&lt;s,t&gt;&gt;,&lt;s,t&gt;&gt; respectively in an omega-order logic.</Paragraph>
    <Paragraph position="9"> For a full exposition of the logic, the reader is referred to Jowsey \[1990\]. For our presentation, we 4The category (s\s)/s has the potential danger of allowing the following construct, if combined with the rule &lt;B: &amp;quot;*Mary finds a man who \[walks\]s\n p \[and he taIks\]s\s.&amp;quot; The suggestion in Steedman \[1990\] is to add a new pair of reduction rules, X \[X\]~ ffi&gt; X and conj X =&gt; \[X\]~, together with the category of &amp;quot;and&amp;quot; as conj. Thus, the category of &amp;quot;and harry talks&amp;quot; is now \[s\]t~, blocking the unwanted combination.</Paragraph>
    <Paragraph position="10"> will simplify the semantics and drop intensionality altogether. We also drop the sortal constraint, since our examples do not include belief operators and hence the only variables left are of sort e.</Paragraph>
  </Section>
  <Section position="5" start_page="211" end_page="214" type="metho">
    <SectionTitle>
3 A First-Order Unification
</SectionTitle>
    <Paragraph position="0"> We will follow the standard technique of combining the syntactic information and the semantic information as in (3.1), where up-arrow symbols (,-,)5 are used to give structures to the semantic information for partial execution (Pereira &amp; Shieber \[1987\]), which has the effect of performing some lambda reduction steps at compile time.</Paragraph>
    <Paragraph position="1">  (3.1) Basic Categories (a) n: (de'do) (b) rip: (de'do)&amp;quot; (de'ro) &amp;quot;So (c)  The term do in (3.1a) and (3.1b) encodes domain constraint for the variable de. Likewise, the term ro in (3.1b) specifies range constraint for de. The term So in (3.1b) and (3.1c) encodes the sentential constraint associated with a sentence. In order to avoid possible confusion, we shall henceforth call categories without ~emantic information &amp;quot;syntactic&amp;quot; categories.</Paragraph>
    <Paragraph position="2"> In this section, we will develop lexical entries for those coordinate constructs in (1.1) through (1.5), or variations thereof. For each case, we will start with &amp;quot;more immediate but faulty&amp;quot; solutions and present what we believe to be the correct solution in the last. (For those who want to skip to the correct lexical entries for each of the cases, they are the ones not commented out with %.) We have seen the lexical entry for sentential conjunction in (2.7d). The lexical entry for predicate conjunction can be similarly encoded, as in (3.2).</Paragraph>
    <Paragraph position="4"> When the conjoined predicates are combined with the subject noun phrase, the subject NP provides only the domain constraint, through A in the first line. The range constraints in the last two NP categories guarantee that B1 and B2 will bear the same variable X in them, so that they can be safely SNot to be confused with Montague's ha~ek symbol, '^'  put as the range constraint of the first NP category. The CLF for (1.2) from (3.2) is shown in (3.3).</Paragraph>
    <Paragraph position="5"> (3.3) exists(Xl, farmer(Xl)~(walk(Xl)~ talk(Xl))) Let us turn to noun phrase coordination, e.g., (1.3). The first try, on the model of predicate conjunction, would be: (3.4) Lexical Entry for NP Conjunction:</Paragraph>
    <Paragraph position="7"> The intention is to collect the two domain constraints via A1 and A2, to get the range constraint from D in the first line, and then to combine them by joining the two sentential constraints B and C of the domain categories. This idea however does not work, since the variables Y= and Z do not appear in the range constraint D. As a result, (3.4) will give the following ill-formed CLF for (1.3).</Paragraph>
    <Paragraph position="8"> exists (Xl, farmer (X i) &amp;talk (X3)) Rforall (X2, senator (X2) =&gt;talk (X3)) We therefore need to use distinct variables in place of D for the two range constraints which will have the same predicate symbol for their range categories. Using the Prolog predicate univ ('=.. '), we can correct (3.4) as follows: 6 (3.5) Lexical Entry for NP Conjunction:</Paragraph>
    <Paragraph position="10"> This is an explicit case of a first-order simulation of second order variables. Unfortunately, this does not work, for several reasons7 First, this handles predicates of arity 1 only, and we need to know the type of each argument if we want to provide a different category for each predicate of different arity. Second, this can not be combined with predicate coordination, for example, such as &amp;quot;john and  Prolog requires at least one of the two variables V and Fred to be already instantiated for the univ to work. This can not be expected when the noun phrase conjunction is being processed, since we don't yet know what predicate(s) will follow.</Paragraph>
    <Paragraph position="11"> a woman walk and talk,&amp;quot; or some complex verbs that may require several predicates, such as &amp;quot;believes&amp;quot;, since it assumes only one predicate for the range constraint.</Paragraph>
    <Paragraph position="12"> The solution we propose is to use the revised semantics of &amp;quot;and&amp;quot; in (1.11) instead. That is, we expect (3.6) from (1.3):</Paragraph>
    <Paragraph position="14"> We need to distinguish the variable X2 in the second line from the variable X2 in the fourth line, via something like c~ conversion, since in the present form, the Prolog will consider them as the same, while they are under distinct quantifiers.</Paragraph>
    <Paragraph position="15"> In fact, since we are separating the semantic interpretation into two levels, we can further process the CLF at the second semantic interpretation level to eliminate those spurious bindings such as exists(X, (X=u)~tu) along with variable renaming to derive the logical form (3.7) from (3.6):  (3.7) exists (Xl, farmer(Xl ) &amp;talk(Xl) ) aforall (X3, senator (X3) =&gt;talk (X3)) (3.8) produces the CLF (3.6) for (1.3).</Paragraph>
    <Paragraph position="16"> (3.8) Lexical Entry for NP Conjunction:</Paragraph>
    <Paragraph position="18"> \np: A1&amp;quot; (Y&amp;quot; (exists (X, (X=Y) &amp;D) ) ) &amp;quot;B) /np : A2&amp;quot; (Z&amp;quot; (exists (X, (X=Z) ~tD) ) ) &amp;quot;C).</Paragraph>
    <Paragraph position="19"> The reason why we are able to maintain in the two domain categories two different forms of range contraints is that the only place that will unify with the actual range constraint, i.e., the predicate, is the range constraint part of the range category only. We note in passing that Jowsey provided yet another approach to noun phrase coordination, a generalized version of his idea as shown  This approach has its limits, however, as indicated in the footnote 8.</Paragraph>
    <Paragraph position="20"> We now turn to some of the non-standard constituent coordination. First, consider (1.4), which is an instance of Right Node Raising (RNR). The CCG syntactic category of the conjunction &amp;quot;and&amp;quot; in this case is (C\C)/C, where C is s/np. (3.9) shows one derivation, among others, for (1.4). The syntactic category of &amp;quot;finds&amp;quot; is (sknp)/np. (3.9) One derivation for (1.4).</Paragraph>
    <Paragraph position="21"> harry finds and a woman cooks a musMroom ..... &gt;T ....... &gt;T s/(s\np) .... s/(s\np) ..... np</Paragraph>
    <Paragraph position="23"> % /(s:S3/np:A'(X'B2)'S2).</Paragraph>
    <Paragraph position="24"> For example, (3.11) will produce the CLF (3.12) for the sentence &amp;quot;harry finds and mary cooks a mushroom.&amp;quot; (3.12) exists(Xl,musbxoom(Xl)~find(h,Xl)&amp; cook(m,Xl)) However, this works only for pairs of proper nouns. For example, for the sentence &amp;quot;every man finds and a woman cooks a mushroom,&amp;quot; it will give the ill-formed CLF (3.13) where the domain constraint for the noun phrase &amp;quot;a woman&amp;quot; is gone and X3 is therefore unbound. This happens because the sentential constraint S2 is not utilized for the final sentential constraint.</Paragraph>
    <Paragraph position="26"> Putting the two sentential constraints Sl and s2 together as follows does not work at all, since the relation between S and SO is completely undefined, unlike the ones between S1 and B1 and between S2 and B2.</Paragraph>
    <Paragraph position="27">  (1.5) shows another case of non-standard constituent coordination, which we will call an instance of Left Node Raising (LNR). The syntactic category of &amp;quot;and&amp;quot; for LNR is (C\C)/C where C is (sknp)\(((sknp)/np)/np). (3.16) shows one syntactic derivation for (1.5). The syntactic category of &amp;quot;gives&amp;quot; is ((sknp)/np)/np. (3.16) One derivation for (1.5), fragment.</Paragraph>
    <Paragraph position="28"> every dog a bone  Sin this case, we can no longer use the disjunctive technique such as forall(Xl, (Xl= v Xl= )=&gt;give( ,X1, )) for the CLF, since Xl is now a pair. The problem gets worse when the conjoined pairs do not have the same type of quantifiers, as in (1.5).</Paragraph>
    <Paragraph position="30"> % /np:A6&amp;quot;(Z'SS)'S6))).</Paragraph>
    <Paragraph position="31"> It gives the eLF (3.19) for (1.5): (3.19) Semantics of (1.5) from (3.18): forall (Xl, dog (X 1 ) =&gt;exist s (X2, bone (X2) ~give (m, Xl ,X2) ) ) ~exist s (Xl, policeman(Xl) * exist s (X2, flo.er (X2) ~give (m, X I, X2) ) ) Unfortunately, (3.18) favors quantified nouns too much, so that when any proper noun is involved in the conjunction the constant for the proper noun will appear incorrectly in the two sentential constraints at the same time. It seems that the only way to resolve this problem is to create four variables, Y1, Y2, 7.1 and Z2, at the semantics level, similar to idea in (1.11). (3.20) implements this proposal.</Paragraph>
    <Paragraph position="32">  for which no CLF could be derived if we were using (3.18). This completes our demonstration for the technique.</Paragraph>
    <Paragraph position="33"> The natural question at this point is how many lexical entries we need for the conjunct &amp;quot;and&amp;quot;. If natural language makes every possible category conjoinable, the number of entries should be infinite, since function composition can grow categories unboundedly, if it can grow them at all. We predict that in natural language we can limit the conjunction arity to n, where n is the maximum arity in the lexicon.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML