File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/94/c94-2201_abstr.xml
Size: 21,117 bytes
Last Modified: 2025-10-06 13:48:10
<?xml version="1.0" standalone="yes"?> <Paper uid="C94-2201"> <Title>DISCONTINUITY AND THE LAMBEK @ALCULUS</Title> <Section position="1" start_page="0" end_page="1238" type="abstr"> <SectionTitle> DISCONTINUITY AND THE LAMBEK @ALCULUS </SectionTitle> <Paragraph position="0"/> <Section position="1" start_page="0" end_page="1238" type="sub_section"> <SectionTitle> Introduction </SectionTitle> <Paragraph position="0"> This paper is concerned with tile treatment of discontinuous constituency within Categorial Grammar.</Paragraph> <Paragraph position="1"> In particular, I address the problem of providing an adequate formalisation of categorial commctives l)rol)osed by Moortgat (1988), which are useful for handling certain forms of diseontimmus eonstitnency, l)espite some interesting proposals, a satisfactory logic for these e{mnectives has so far remained dnsive. I will provide such a h)gie, using an approach that falls within the general framework of labelled deductive systems (Gabbay, 1991), employing novel methods for reasoning ahout linear order in resource nsage. The approach is illusl;rated by linguistic al}plications for extraction, pied-piping and quanti\[ieation.</Paragraph> <Paragraph position="2"> The Lambek calctflus Our general fran,ework is the associative I,ambelc calculus (L: l,mnbek, 1958), a system which falls within the class of formalisms known as Categorial Grammars. The set of types is freely generated firom a set of primitiw; (atomic) types (e.g. {s, np .... }), using binary infix el)craters \, /, .. The 'meaning' of these connectives in L is fixed hy a senlanties for the logic, based on a (se,nigroup)algehra of strings (Z:,.), i.e. where * is an associative, notl-coulmtltative binary operator, with two-sided identity e, and E is the set Of non-eHlpty (-7 t:- g) strings over some vocabulary. An interpretation funcl;ion \[\[~ assigns some subset of Z; to each type, satisfying the conditions be.low for conlplex types and gyp(', sequence.s. A type comhi-.</Paragraph> <Paragraph position="3"> liar/oil X 1 ..... Xtz --> X0 h.ohls in a model ((PS,.), \[\[\]l ), if {\[x~ .... ,x.\]\]_cl\[Xol\], a.d is ~lid if it is true in all models. '1'here are several formulations of L that all realise this same meaning for the connectives.t</Paragraph> <Paragraph position="5"> \]The alternatlve formulations include e.g. seqnent (l,aml}ek 1958), proof net (l~.oor(la 199:1 ), and natural deduct.ion systems (Morrill e! aL 1990, Barry eg al. 1991). Alternative formulations carry different advantages, e.g. natural dcductlon is well suil.cd for lh,guisl.ic prcsental.ion, whereas proof nets haw~ benefits for automated theorem proving.</Paragraph> <Paragraph position="6"> l)iscontinuous type constructors The I,ambek calculus is a purely concatenative system: where any two types are combined, the string of the result is arrived at by concatenating the strings of the types combined. This point is illustrated graphically in (la,b), for the Lambek tractors, where (following Moortgat, 1991) each triangle represents a result string, and unshaded and shaded areas represent fnnetor and argutlre.nt stril/gs, respectively.</Paragraph> <Paragraph position="8"> Various linguistic phenomena, however, suggest the existence of discontinous constituency, i.e. situations the result string from combining two constituents is not produced by concatenating the component strings.</Paragraph> <Paragraph position="9"> (See e.g. Bitch, 1981.) Moortgat (1988) suggests augmenPSing \]T, with two discontinuous type construct, ors. An exh'aclion fimctor X\]Y is one whose argument corresponds to a non-peripheral (or more precisely, nor necessaribj peripheral) suhstring of the result of con> binaries, as it, (lc). An infixation fimetor XIY itself corresponds to a non-l)eril)heral substring of the result of combination, as in (ld). Given these intuitiw'~ characterisations, two options arise for the meaning of each comlecLive as to whether the point of insertion of one striug into the other is free (universal) for lixed (existential). In this paper, I will focus on the existential variants of the commctives, which appear to he the most linguistically useful, and whose interpretive conditions arc as follows: Previous proposals F, ach connective should have two inference rules: a rule of proof (showing how to derive a type containing the connective), and it rule of nse (showing how to employ such a type). This indicates a possible eight inference rules that we might hope to state (i.e. proof/use x universal/existential x infixation/extraction). V~ri~ ous attempts have I~eeu made to provide a logic for tile discoutiuuous type constructors, but all have proved ullsu('cessflll or unsatisI'actory in some way or another. Moortgat (1988), for example, uses an ordered sequent calculus framework, which allows only two of the possible eight rules to be stated: a rule of proof for existential T, and a rule of nse for universal ~. Moortgat (1.991) nses a proof method in which types are not ordered in proof representations, where linear order constraints and consequences are instead handled using a system of string labelling, i.e. types are associated with string terms, which are explicitly manipulated by inference rules. This approach allows two further rules to be stated, but the four expressible rules are distribnted one per connective, i.e. a complete logic is not given for even any one connective. As Versmissen (1991) notes, Moortgat's string label system does not allow the recording of a specific position for inserting one string into another, as would seem to be required.</Paragraph> <Paragraph position="10"> Morrill & Solias (1993) avoid this latter problem by augmenting the string \]al~elling algebra with a nonassocia(,ive pairing operator (., .), allowing labels such as {st, s2), indicating an insertion point in between sl and s2. 'lPhis system allows versions of T and ~ operators to be specified, hut ones whose interpretive definitions differ from Moortgat's. The non-associativity of pairing gives rise to limited flexibility for the system in terms of the type combinations that can be derived, and even the types that can be constructed, e.g. no flmctor (X~Y)/Z, where a \] argument is not the first sought, is allowed.</Paragraph> <Paragraph position="11"> Labelled deduction &: Lambek calculus I next develop a formulation of L which can be extended to allow for the (existential) discontinuity connectives. Our starting point is a lambda term semantics for implicational L due to Bnszkowski (1987), ba.sed on the well kuown Cnrry-Howard interpretal;ion of proofs (Iloward, 1969)3 This uses a bidirectional variant of the lambda calculus whose basic terms are directionally typed variables. If t is a term of type Y\X (resp. X/Y), and u one of type Y, then (at) t (resp. (tu)&quot;) is a term of type X. If v is a variable of type Y, and t a term of type X, then klv.t (resp. 2&quot;v.t) is a term of type Y\X (resp. X/Y). A semantics for logical formulas are regarded as types of expressions in typed lalnbda calculus, with atomic formulas corresponding to basic types, and a formula A--~B to the type of functions from A to tl. It is dmnonstrable that the set of formulas for which there exists stone correspondingly typed lambda term is precisely the theorems of the impfieatlonal fragment of intuitlonistlc logic. Thus, typed lambda calculus provides a s emantlcs for implicational intuitlonlstic logic, i.e. an independent, characterlsation of 'valid deductlon',just as the algebralc semantics of L provides an independent characterisatlon of validity for that system. Semantics for vm'ious other logics can be given in terms of classes of typed lambda terms, i.e. subsets of the typed lambda terms which satisfy certain stated criteria, van Benthem (1983) provides a lambda semantics for the system LP, a eonmmtative variant of L. Wansing (1990) provides lambda semantics for a range of subloglcs of intultlonistie logic. The Curry-tloward interpretation so permeates categorlal work that the terms &quot;formula&quot; and &quot;type&quot; have become almost interchangeable. Note that I have slightly modified BuszkowsM's notation.</Paragraph> <Paragraph position="12"> satisfy the conditions: (lll) each subterm contains a flee variable, (132) no subterm contains > 1 free occurrenee of any variable, (133) each A t (resp.)?') binds the leftmost (resp. rightmost) free variable in its scope.</Paragraph> <Paragraph position="13"> This semantics can be used in formulating (implicationM) L as a labelled deductive system (LDS: Gabbay, 1991). a LM)els are terms of the directionM lambdasystern, and propagation of labels is via application and abstraction in the standard manner. Natural deduction rules labelled in this way are as follows: We can eusure that only deductions appropriate to (implicational) L are made by requiring that the label that results with any inference is a term satisfying Buszkowski's three conditions. To facilitate testing this requirement, I use a flmction E, which maps from label terms to the string of their free variables occurring in the left-right order that follows from type directionality (giving what I call a marker term). A notion of 'string equivalence' ( ~- ) for marker terms is definecl by the axioms: (-~.1) *,(>z)+-(x.y).z (- 2) .-~..</Paragraph> <Paragraph position="14"> (-.3) .~-x.c E is recursively specified by the following clauses (where PV returns the set of fi'ee variables in a term), but it is defined for all and only those terms that satisfy Bnszkowski's three conditions. 4 Thus, we can ensure correct deduction by requiring of the label that results with each inference that there exists some marker term m such that )_'(a) = m.</Paragraph> <Paragraph position="16"> where l&quot;V(~&quot;v.( 0 C/ ~, ~(a) - p,v The followiug proofs illustrate this LDS (nsing t~m 3111 labelled deduction, each fornulla is associated with a label, which records information of the use of resources (i.e. assmnptions) in proving that formula. Inference rules indicate how labels are propagated, and may have side conditions which refer to labels, using the information recorded to ensure col recL infcrencing. Evidently, the Moortgat (1991) and Morrill & Solias (1993) formalisms arc LDSs.</Paragraph> <Paragraph position="17"> 4Condition B2 is enforced by the requirement on the application cases of E. Conditions B1 and B3 are enforced by the first and second rcquirement on the ahstraction cases of E, respectively.</Paragraph> <Paragraph position="18"> as shorthand for E(t)-m, to indicate a significant</Paragraph> <Paragraph position="20"> This system can be extended to cover product using the inference rules (3), and the additional )2 elanses shown following (with the obvious implicit extensions of the directional lambda system, and of Buszkowski's semantics). Labelling of \[.I\] inferences is via pairing, and that of \[,.El inferences llses all operator ada.pged from \]lenton el al, (1992), where a term \[b/v.w\].a implicitly represents tim suhsl, itiitiol |of b for v+w iii a. This rule is nsed hi (4).</Paragraph> <Paragraph position="21"> (a) D:,\]\[c:,I,\] a:, i~:~ ,1 A : a t3.0 : b AIB : (a, b)</Paragraph> <Paragraph position="23"> Labelled deduction gz discontinuity '\]'his al)proach can be exi;eilded l;o allow for exist.enl, ial i a.nd ,\[. These conneci;iw;s have sLatidard iinl)licational inference rules, |lsing additional distinguished operators for labelling (with superscript e for extrac- null tion and i tbr lullxat|on): (5) AtI~:~, ~:b \[1~:~}</Paragraph> <Paragraph position="25"> (lens|tier /irsl;ly how >; Iiuist; I)e extended for the abstract.ion cases of |;he new introduction rules. For a \[II\] terin such as Aev.a, l.\]|e relevant E case allows v to appear non-peripherally in the marker term of a. For a \[Ill term such as ,~v.a, v is allowed to he discontinuous in the marker of a (we shall see shortly how this is possible), bug requires its components to appear peripherally.</Paragraph> <Paragraph position="26"> where ~W(.\%..) C/ ~, ~:(a)-' ,q~.v.,e~ To allow for the new application operators, the marker system must be extended. Recall that the linear order information implicit in lahcls is projected onto the left,right dimension in markers. With 1&quot; and 1, however, |lie possibility exists that either fiuletor or argument is discontirmous in the result of their combination. For strings x G \[\[XTY\]\] and y G \[\[Y\]~, for example, we know there is some way of wrapping x around y to give a resnll, in X, but we do no~ in general know how the division of .v should be made. This problem of uncertainly ix handled by using operators L and R, where 1,(rn) and It(m)represent indefinile but complemen~ lary left and right suhcomponents of the marker term m. (L arlcl 1{ are not projection time|ions.) This idea of the significance of :1, and II, in given content via the additional axiom ( =' .d), which allows that if the eomplemenl;ary left; and right snhcomponents of a marker appear in ,q)propl:iate lel't-rightjuxtaposition, then the l~larker~s resotirees Irmy be treated as continuous. 5</Paragraph> <Paragraph position="28"> The remainii,g clauses for L; then are:</Paragraph> <Paragraph position="30"> 5rl'his axiom may I)e seen as stating the lhnlt of what cml be said cmtccrldilg 'uncerl.ainly divided' resources, i.e. only where tile unccrt.aint.y is elimina{.cd by juxtaposition can the l,,t{ oper~k\[ ol's })e ielllov(!(Ii iiiitkii/~ r bOll/(l o\[.herwise qiidden ~ l'esource visible. I,'m'ther reasonable axioms (not in practice required here) are l,(e) -::e and l/(e) ~ e, i.e. |.he only possible left and right subconlponents of ml 'mnpty' marker are likewise empty.</Paragraph> <Paragraph position="32"> (xtz)/(v,z): ::v~o~.(,4vz)*y Word order and NL semantics Labels encode both the functional structure and linear order information of proofs, and hence are used in identifying both the NL semantic and word order consequences of combinations. Label terms, however, encode distinctions not needed for NL semantics, but can easily be simplified to terms involving only a single abstraetor (A) and with application notated by simple left-right juxtaposition, e.g.</Paragraph> <Paragraph position="33"> x~A%.(~(w)&quot;)' ~ ~z~.((vz)~).</Paragraph> <Paragraph position="34"> To determine the linear order consequences of a proof with label a, we might seek a marker m consisting only of concatenated variables, where E(a) - m. These variables would be the labels of the proof's undischarged assumptions, and their order in m would provide an order for the types combined under the proof. Alternatively, for linguistic derivatlons, we mlght: substitute lexical string atoms in place of variables, and seeker a marker consisting only of concatenated string atoms, i.e. a word string. This method is adeqnate for basic L, but problems potentially arise in relation to the discontinuity connectives.</Paragraph> <Paragraph position="35"> Consider the transformation X/Y => Xi&quot;Y. The connective of the result type does not record all the linear order import of the input type's connective, and neither consequently will the application label operator for a subsequent \[\]E\]. Itowever, fl-normalisation yields a simpler label term whose operators record the linear order information originally encoded in the connectives of the types combined. For exarnple, the following proof includes a subderivation of X/Y =~ X\]Y. The overall proof term does not simply order the proof's assumptions under )3 (giving marker L(z).y.l/,(x)),' but its t-normal form (xy)&quot; does (giving x.y).</Paragraph> <Paragraph position="37"> Of course, normalisation can only bring out ordering information that is implicit in the types combined. 1:'or example, the combination XTY:~,Y:y => X:(xy) ~ is a, theorem, but the label (xy) ~ does not simply of der x and y. However, if we require that lexical subcategorisation is stated only using the standard Lambek connectives, then adequate ordering information will always be encoded in labels to allow simple ordering for linguistic derivations. Alternatively, we could allow discontinuity connectives to be used in stating lexical subcategorisation, and farther allow that lexical types be associated with complex sh'ing terms, constrncted using label operators, which encode the requisite ordering information. For example, a word w with lexical type XTY might have a string term ,Vv.(wv)&quot;, which does encode the relative ordering of w and its argument. A more radical idea is that deduction be made over lexical types together with their (possibly complex) lexical string terms, and that the testing of side conditions on inferences be done on the /3-normal form of the end label, so that the implicit ordering information of the lexical string term is 'brought out', extending proof possibilities. Then, tile lexical units of the approach are in effect partial proofs or derivationsfi Such a change would greatly extend the power of tile approach. (We shall meet a linguistic usage for this extension shortly.) Linguistic applications We shall next briefly consider some linguistic uses of the discontinuity connectives in the new approach. The most obvkms role for \] is in handling extraction (hence its name). Adapting a standard approach, a relative pronoun might have type rel/(sTnp), i.e. giving a relative clause (rel) if combined with s\]np (a 'sentence missinga NP somewhere'). Note that standard L allows only types rel/(s/np) and rel/(np\s), which are appropriate for extraction from, respectively, right and left peripheral positions only. For example, whom Mary considers _ foolish can be derived under the following proof. The atom string (6a) results via substitution of lexical string terms in the proof label, and )3. Substitntion of lexical semantics and deletion of directional distinctions gives (6b).</Paragraph> <Paragraph position="39"> b. whom' (,~u.considers / u foolish I mary I) Moortgat (1991) suggests that a (for example) sententially scoped NP quantifier could be typed s~(s~np), if inlixation and extraction could be linked so that infixation wan to the position of the 'missing np' of sTnp. r Such linkage does not follow from tile definitions of the connectives but can be implemented in the 6Thls idea invi~.es cmnparlsons to formalisms such as lea:iealised tree adjoining 9':aramar (see Joshi et al, 1991), where the basic lexical and derivatiomfl units are partial phrase structure trees associated with h:xical items.</Paragraph> <Paragraph position="40"> 7In the approach of Morrill & Solias (1993) such linkage follows automatically given the int~erpretive definitions of their connectives. Moorgat (1990,1991) proposes special purpose quantification I,ype const.rnetors.</Paragraph> <Paragraph position="41"> present approach by assigning a complex lexical string term, as in the lexical entry (<'rYeE,STmNG,SEM>): <st(sTnp), A,,.,, ~,degso,,~eone, someone'> Such a string term would result under a 'type raising' transformation sud, as: np => s~(s\[np). 'Phe example goh.u gave someone money can be derived am follows, with string and semantic results in (7).</Paragraph> <Paragraph position="42"> b. sorneone' (Av.gave' v money' john') Tllere is a sense in which this view of quantifiers seems very natural. Quantifiers behave distributionally like simple NPs, but semantically are of a higher type. I{aising the string component under tile transformation np => s.\[(slnp) resolves this incompatibility without imposing additional word order constraints.</Paragraph> <Paragraph position="43"> This aeCOllnt as stated does not allow for multiple quantitication, 8 bu~, would if lexical string terms were l;reated as part;\el proofs used in assembling larger deriwfl, ions, as suggested in the previous section.</Paragraph> <Paragraph position="44"> In interesting test case, combining both movement and scope issues, arises with pied piping construcl, ions, where a whdtem moving to clause initial position is accompanied by (or 'pied pipes') some larger phrase that conl, ains it, as in e.g. the relative clause to whom John spoke, where the PP to whom is fronted. Following Morrill & Sol\as (1993), and ultimately Morrill (1992), a treatment of pied piping can be given using \]&quot; and 1. Again, linkage of in\[ixation and extraction is achieved via complex lexical string assignmont. A l)P lfied-piping relative t~rono,m might b(' (rel/(s\]pp))l(l)plnp) allowing it to infix to a NP I,Osit\on within a Pl', giving a functor rel/(s\]pp), i.e. which prefixes to a 'sentence missing I'P' to give a relative clause. Ilence, for example, lo whom wonld have type rel/(slpp), and so Io whom ,/oh~z spoke is ~ relative clause. The lexical semantics of whom will ensure that the resulting meaning is eqnivalent to the nonq)ied piping w~riant whom John spoke to.</Paragraph> </Section> </Section> class="xml-element"></Paper>