File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/93/e93-1034_metho.xml
Size: 27,639 bytes
Last Modified: 2025-10-06 14:13:17
<?xml version="1.0" standalone="yes"?> <Paper uid="E93-1034"> <Title>Tuples, Discontinuity, and Gapping in Categorial Grammar*</Title> <Section position="3" start_page="287" end_page="289" type="metho"> <SectionTitle> 2 Prosodic Labelling </SectionTitle> <Paragraph position="0"> As we shall see, the implicit coding of prosodic operations in the ordering of a sequent is not expressive enough to represent the logic of discontinuity connectives. In this connection, \[Moortgat, 1991b\] employs \[Gabbay, 1991\] notion of labelled deductive system (LDS). When we label for prosodics as well as semantics, a sequent has the form al - zi: At, ..., am - x, :Am ::~ a - C/: A where n >_ 0, no prosodic or semantic variable is associated with more than one category formula, c~ i s a prosodic term over variables {al,..., an} and C/ is a typed lambda term over (free) variables {zl,..., Zn}. The prosodically and seman2A general preparation for such multidimensional characterisation is provided by \[Oehrle, 1988\] which effectively refines Montague's program in order to provide a more even-handed treatment of linguistic dimensions. But note that Oehrle anticipates only functions as prosodic and semantic objects. Here the prosodic algebra is not marie up of functions, and nor are functions the only kind of semantic object. The symmetric treatment of prosodies and semantics concurs with the contemporary trend for 'sign-based' grammatical formalisms such as HPSG \[Pollard and Sag, 1902\], though this latter only goes so fax as recursively defining a relation between prosodic and semantic forms, i.e. representations. By interpreting categories in the way set out in \[Morrill, 1992a\] as pairings of prosodic and semantic objects we make direct reference to their properties as defined in terms of mathematical models, and use forms only in the metatheory. null tically labelled calculus is as follows. 3 to the prosodic dimension.</Paragraph> <Paragraph position="2"> .,,It F,A =:*. a+/~- (C/,C/):A..B The pattern of prosodic interpretation and prosodic labelling given above is entirely general. The interpretation scheme is called residuation. Under the scheme we define in terms of any binary operation +n complementary (or: dual) division operators \n and/n and product operator ..n by the clauses given in (5).</Paragraph> <Paragraph position="4"> As a consequence the following laws hold (see \[Lambek, 1958; Lambek, 1988; Dunn, 1991; Moortgat, 1991a; Moortgat and Morrill, 1991\]: 4 A ::t, C/riB HF- A.nB =~ C Hk B ::V A\nC (6) The LDS logic directly reflects this interpretation. It always has the following format, together with label equations in accordance with the axioms of the algebra of interpretation.</Paragraph> <Paragraph position="6"> We are free to manipulate labels according to the equations they satisfy. In the case of associative Lambek calculus there is the assoeiativity c. law; in the case of non-associative Lambek calculus there would be no equations on labels. Observe that with prosodic labelling, the structural rules permutation, contraction, and weakening are d.</Paragraph> <Paragraph position="7"> valid. In our labelling, we maintain the convention that antecedent formulas are labelled with prosodic and semantic variables. As a result each theorem al -xvA1,...,an -xn:An =:&quot; a-C/:A can be read as a Montagovian rule of formation with input categories AI,...,An and output category A and prosodic and semantic operations a and C/. Other f.</Paragraph> <Paragraph position="8"> versions of labelling allow labelling antecedent formulas with prosodic and semantic terms in general.</Paragraph> <Paragraph position="9"> However such labelling constrains the value of the elements to which the theorems apply by reference to g.</Paragraph> <Paragraph position="10"> the terms that represent them. In relation to grammar, this would mean conditioning rules on the semantic and/or prosodic form of the input. For instance, with respect to semantics, this would consti- h. tute essential reference to semantic form in the way which Montague grammar deliberately avoids. We advocate exactly the same transparency in relation 3In prosodic and semantic terms we allow omission of parenthesis under associativity, and under a convention that unary operators bind tighter than binary operators.</Paragraph> <Paragraph position="12"> an R r, A :,. a +. ~: A*.B 4In fact the residuation scheme is even more general than that which we need here: is applies to ternary 'accessibility' relations in general, not just to binary functions, i.e. deterministic ternaxy relations.</Paragraph> <Paragraph position="13"> The semantic interpretation with respect to function and Cartesian product formation can also be applied uniformly, with systematic labelling as in the previous section.</Paragraph> </Section> <Section position="4" start_page="289" end_page="290" type="metho"> <SectionTitle> 4 Discontinuity </SectionTitle> <Paragraph position="0"> Elegant as such categorial grammar is, it is more suggestive of an approach to computational linguistic grammar formalism, than actually representative of such. Amongst the various enrichments that have been proposed (see e.g. \[van Benthem, 1989; Morrill et al., 1990; Barry et al., 1991; Morrill, 1990a; Morrill, 1990b; Moortgat and Morrill, 1991; Morrill, 1992a; Morrill, 1992b\]), \[Moortgat, 1988\] advanced earlier discussion of discontinuity in e.g. \[Bach, 1981; Bach, 1984\] with a proposal for infixing and wrapping operators. The operators not only provide scope over these particular phenomena but also, as indicated in e.g. \[Moortgat, 1990\], seem to provide an underlying basis in terms of which operators for binding phenomena such as quantification and reflexivisation should be definable. The coverage of pied piping in \[Morrill, 1992b\] would also be definable in terms of these primitives, but all this depends on the resolution of certain technical issues which have been to date outstanding.</Paragraph> <Paragraph position="1"> Amongst the examples we shall be able to treat by means of our present proposals are the following.</Paragraph> <Paragraph position="2"> a. Mary rang John up. (8) b. Mary gave John the cold shoulder.</Paragraph> <Paragraph position="3"> c. John likes everything.</Paragraph> <Paragraph position="4"> d. for whom John works.</Paragraph> <Paragraph position="5"> e. John studies logic, and Charles, phonetics.</Paragraph> <Paragraph position="6"> In the particle-verb construction (8a) and discontinuous idiom (8b), the object 'John' infixes in discontinuous expressions with unitary meanings. In (8c) the quantifier must receive sentential semantic scope, and in (Sd) the pied piping must be generated, with the semantics of 'whom John works for'. In (Be), the semantics of the verb gapped in the second conjunct must be recovered from the first conjunct.</Paragraph> <Paragraph position="7"> Binary operators T and ~ are proposed in \[Moortgat, 1988\] such that BTA signifies functors that wrap around their A arguments to form Bs, and BIA signifies functors that infix themselves in their A arguments to form Bs. Assuming the semigroup algebra of associative Lambek calculus, there are two possibilities in each case, depending on whether we are free to insert anywhere (universal), or whether the relevant insertion points are fixed (existential). We leave semantics aside for the moment.</Paragraph> <Paragraph position="9"> Inspecting the possibilities of ordered sequent presentation, of the eight possible rules of inference (use and proof for each of four operators), only TjR and</Paragraph> <Paragraph position="11"> This is the partiM logic of \[Moortgat, 1988\]. Note that the absence of a rule of use for existential wrapping means that we could not generate from discontinuous elements such as ring up and give the cold shoulder which we should like to assign lexical category (N\S)TsN. (Evidently Tv would permit incorrect word order such as *'Mary gave the John cold shoulder'.) The problem with ordered sequents is that the implicit encoding of prosodic operations is of limited expressivity. Accordingly, \[Moortgat, 1991b\] seeks to improve the situation by means of explicit prosodic labelling. This does enable both rules for e.g. ~v but still does not enable the useful TjL: the remaining problem is, as noted by \[Versmissen, 1991\], that we need to have an insertion point somehow determinate from the prosodic label for an existential wrapper in order to perform a left inference.</Paragraph> <Paragraph position="12"> In \[Moortgat, 1991a\] a discontinuity product is proposed, again implicitly assuming just a semigroup algebra: 5</Paragraph> <Paragraph position="14"> As for the discontinuity divisions, ordered sequent presentation cannot express rules of both use and proof: only (r)R can be represented:</Paragraph> <Paragraph position="16"> Even using labelling, the problem for (r)L remains and is the same as that above: there is no proper management of separation points.</Paragraph> <Paragraph position="17"> In \[Moortgat, 1991a\] it is observed how the quantifying-in of infix binders such as quantifier SThe version given is actually just the existential case of two possibilities, existential and universal, as before. No rules for the universal version can be expressed in ordered sequent calculus, or labelled sequent calculus.</Paragraph> <Paragraph position="19"> Figure h Labelled rules for discontinuity operators phrases seems almost definable as SI(STN): they infix themselves at N positions in Ss (and take semantic scope at the S level - that is why they must be quantified in). And if this definability could be maintained, it would enable these operators to simulate the account of pied piping in \[Morrill, 1992b\]. None of the interpretations above however enable the expression of the requirement that the positions referred to by the two operator occurrences are the same. Our proposals will facilitate this definability, and also admit of a full (labelled) logic.</Paragraph> </Section> <Section position="5" start_page="290" end_page="291" type="metho"> <SectionTitle> 5 Tuple Control of Insertion Points </SectionTitle> <Paragraph position="0"> The present innovation rests on extending the prosodic algebra (L*,+,t) as above to an algebra (L*, +, t, (., .), 1, 2) where (., .) is a binary operation of tuple formation (introduced in \[Solias, 1992\]), with respect to which 1 and 2 behave as projection functions. Thus the algebra satisfies the conditions:</Paragraph> <Paragraph position="2"> We may in particular think of the algebra of elements V* obtained from disjoint sets V and {\[, ;, \]} by closing V under two binary operations: concatenation +, and pairing \[.; .\] where pairing can be defined as concatenation with delimitation and marking of insertion point.</Paragraph> <Paragraph position="3"> The proposal can be related to \[Moortgat and Morrill, 1991\] which also considers algebras with more than one adjunction operation (each either associative or non-associative), and defines divisions and products with respect to each by residuation.</Paragraph> <Paragraph position="4"> Note however that firstly, our tuple prosodic operation is not simply that of non-associative Lambek calculus which is characterised by the absence of any axiom (associative or otherwise), since the projection axioms entail specific conditions not imposed in the non-associative case: we might describe the tuple system as unassociative. Tupling is bijective and a prosodic object s formed by tupling records a separation point between two objects ls and 2s whereas a prosodic object formed by non-associative adjunction has no such recoverable separation point.</Paragraph> <Paragraph position="5"> Secondly, we are not primarily interested here in divisions and products based on tupling but in the combined use of the associative and unassociative operations to define discontinuity operators. (Note however that residuation with respect to tupling, as proposed in \[Solias, 1992\], would define operators suitable for verbs regarded as head-wrappers such as 'persuade'.) This brings us to the essence of the present proposals with respect to wrapping and infixing. The prosodic interpretation for the discontinuity operators is to be as follows:</Paragraph> <Paragraph position="7"> It can be seen that the operators are the residuation divisions with respect to a binary prosodic operation I defined by szIs2 = 181 + s2 + 281 just as the Lambek operators are the residuation divisions with respect to +. Use of the tuple operation collapses the former distinction between existential and universal in (9) and (10). Because pairing is bijective and tuples express a unique insertion point, there is a unique decomposition of tupled elements. Existential and universal wrappers collapse into a single wrapper and existential and universal infixers collapse into a single infixer.</Paragraph> <Paragraph position="8"> Turning to include the semantics, the type map is as is to be expected for functors and for product:</Paragraph> <Paragraph position="10"> T(A) x T(B), and as usual a category formula A is interpreted as a subset of L x T(A).</Paragraph> <Paragraph position="12"> The full prosodically and semantically labelled logic is given in Figure 1. In TL lc and 2c pick out the first and second projections of the prosodic object c in the same way that projections pick out the components of a semantic object in the eL rule of (4g); likewise in ~l~ for the projections la and 2a. The resulting prosodic forms are only simplifiable when the relevant objects are tuples. 6</Paragraph> </Section> <Section position="6" start_page="291" end_page="292" type="metho"> <SectionTitle> 6 Discontinuity Examples </SectionTitle> <Paragraph position="0"/> <Section position="1" start_page="291" end_page="291" type="sub_section"> <SectionTitle> 6.1 Phrasal Verbs </SectionTitle> <Paragraph position="0"> As a first example of discontinuity consider the particle verb case 'Mary rang John up' and the discontinuous idiom case 'Mary gave John the cold shoulder'.</Paragraph> <Paragraph position="1"> The meaning of the particle verb and the phrasal idiom resides with its elements together, which wrap around their object. The lexical assignments re-</Paragraph> <Paragraph position="3"> A derivation is given in Figure 2. The lexical prosodies and semantics of the proper names may be assumed to be atoms. For 'Mary rang John up', substitution of the lexical prosodies thus yields (18) which simplifies as shown.</Paragraph> <Paragraph position="5"> Similarly, substitution of the lexical semantics gives (19).</Paragraph> <Paragraph position="6"> ((ring-up john) mary) (19) For 'Mary gave John the cold shoulder', substitution of the lexical prosodies yields: Mary + l(gave, the + cold + shoulder) q- John + 2(gave, the + cold + shoulder) .,z Mary + gave + John + the + cold + shoulder (20) The semantics is: ((gave-tcs john) mary) (21) degHaving the projection functions defined for all prosodic objects rather then just tuple objects allows us to consider the prosodic algebra to be untyped (or: unsorted). Consequently, there is no need to check for the data type of prosodic objects such as by pattern-matching on antecedent terms (see comment above on transparency of rules). It may be possible to develop the present proposals by adding sort structure to the prosodic algebra in a manner analogous to the typing of the semantic algebra. Such sorting could be essential to defining a model theory with respect to which the calculus can be shown to be complete. Recursive nesting of infixation points does not appear to be motivated linguistically, and the present calculus does not support it. A sorted model theory which excludes the recursion might provide an interpretation with respect to which the present calculus is both sound and complete.</Paragraph> </Section> <Section position="2" start_page="291" end_page="291" type="sub_section"> <SectionTitle> 6.2 Quantifier Raising </SectionTitle> <Paragraph position="0"> In Montague grammar quantifying-in is motivated by the necessity to achieve sentential scope for all quantifiers and quantifier-scope ambiguities.</Paragraph> <Paragraph position="1"> Quantifying-in allows a quantifier phrase to apply as a semantic functor to its sentential context.</Paragraph> <Paragraph position="2"> Quantifying-in at different sentence levels enables a quantifier to take scope accordingly, and alternative orderings of quantifying-in enable quantifiers to take different scopings relative to one another. In \[Moortgat, 1990\] a binary operator ~ is defined for which the rule of use is essentially quantifying-in, so that a Montagovian treatment of quantifier-scoping is achieved by assignment of a quantifier phrase like 'something' to N~S, and assignment of determiners like 'every' to (N~S)/CN. In \[Moortgat, 1991a\] he suggests that a category such as A ~ B might be definable as B~(BTA), but notes that this definability does not hold for his definitions, for which, furthermore, the logic is problematic. On the present formulation however, these intuitions are realised. The category S~(STN) is a suitable category for a quantifier phrase such as 'everything' or 'some man', achieving sentential quantifier scope, and quantificational ambiguity.</Paragraph> <Paragraph position="3"> Assume the lexical entry (22).</Paragraph> <Paragraph position="5"> For 'John likes everything' there is the derivation in Figure 3. In this derivation, and in general, lines are included showing explicit label manipulations under equality in the prosodic algebra, in such a way that all rule instances match the rule presentations. Substitution of the lexical prosodies and semantics associates John + likes + everything with (23) which simplifies as shown.</Paragraph> <Paragraph position="7"> In this example the' quantifier is peripheral in the sentence and a category (S/N)\S could have been used in associative Lambek calculus. However, another category S/(N\S) would be needed to allow the quantifier phrase to appear in subject position, and further assignments still would be required for post-verbal position in a ditransitive verb phrase, and so on. Some generality can be achieved by assuming second-order polymorphie categories (see \[Emms, 1990\]), but note that the single assignment we have given allows appearance in all N positions without further ado, and allows all the relative quantifier scopings at S nodes.</Paragraph> </Section> <Section position="3" start_page="291" end_page="292" type="sub_section"> <SectionTitle> 6.3 Pied Piping </SectionTitle> <Paragraph position="0"> In \[Moortgat, 1991a\] and and \[Morrill, 19925\] a three-place operator is considered which is like A B, except that quantifying-in changes the category of the context expression.. \[Morrill, 1992b\] shows that this enables capture of pied piping. It follows from</Paragraph> <Paragraph position="2"> c-c:N =~ c- c:N j-j:N =~j-j:N f-f:S =~f-f.'S \L j-j:N,d-d:N\S :~j+d-(dj):S /i j - j: N, l - h (N\S)/N, c - c: N =~ j+l+c - ((1 c) j): S j -j:N, l-I:(N\S)/N =~ j+l+c+t - ((l c)j):S j -j:N, 1-h (N\S)/N =~ (j+l, t) - Ac((1 c)j):StN TR b-b:S =*. b - b:S 4L j -j: N, 1 - l: (N\S)/N, e -e: SI(STN) ::~ j+l+e+t - (e Ac((l c) j)): S j - j: N, 1 - h (N\S)/N, e - e: S~(STN) =~ j+l+e - (e Ac((l c) j)): S the nature of the present proposals that A~(BTC) presents the desired complicity between the operators. As a result, the treatment of \[Morrill, 1992b\] can be presented in these terms.</Paragraph> <Paragraph position="3"> Consider the example 'for whom John works'. The relative pronoun is lexically assigned as follows where R is the common noun modifier category CN\CN.</Paragraph> <Paragraph position="5"> There is the derivation in Figure 4. The result of prosodic substitution is for + whom + 0'ohn + works, t) (25) The result of semantic substitution is</Paragraph> <Paragraph position="7"> As for the quantification, this example is potentially manageable in just Lambek calculus. But an example where the relative pronoun is not peripheral in the pied piped material, such as 'a man a brother of whom from Brazil appeared on television' would be problematic for the same reasons as quantification.</Paragraph> <Paragraph position="8"> The solution, in terms of infixing and wrapping, is the same in the two cases, but pied piping has been a more conspicuous problem for categorial grammar because while the scoping of quantifiers can be played down, the syntactic realisation of pied piping is only too evident. In the phrase structure tradition, pied piping has been taken as strong motivation for feature percolation (see \[Pollard, 1988\]). We have seen here how discontinuity operators challenge this construal. null Categorial grammar is well-known to provide OSSibilities for 'non-constituent' coordination (see teedman, 1985; Dowty, 1988J) less accessible in the phrase structure/feature percolation approach. We turn now to another example which is glaringly problematic for all approaches, gapping. It is entirely unclear how feature percolation could engage such a construction; but as we shall see the discontinuity apparatus succeeds in doing so.</Paragraph> </Section> </Section> <Section position="7" start_page="292" end_page="294" type="metho"> <SectionTitle> 7 Gapping </SectionTitle> <Paragraph position="0"> The kind of examples we want to consider are: John studies logic, and Charles, phonetics. (27) The construction is characterised by the absence in the right hand conjunct of a verbal element, the understood semantics of which is provided by a corresponding verbal element in the left hand conjunct. Clearly, instanciations of a coordinator category schema (X\X)/X will not generate such cases of gapping. The phenomenon has attracted a fair amount of attention in categorial grammar (e.g.</Paragraph> <Paragraph position="1"> \[Steedman, 1990; Raaijmakers, 1991\]).</Paragraph> <Paragraph position="2"> The approach of \[Steedman, 1990\] aims to reduce gapping to constituent coordination; furthermore it aims to do this using just the standard division operators of categorial grammar. This involves special treatment of both the right and the left conjunct. We present our discussion in the context of the present minimal example of gapping a transitive verb TV.</Paragraph> <Paragraph position="3"> With respect to the right hand conjunct, the initial problem is to give a categorisation at all. Steedman does this by reference to a constituent formed by the subject and object with the coordinator. This constituent is essentially TV\S but with a feature a-a:N =~a-a:N c-c:PP =~c-c:PP /L f-f:PP/N, a-a:N ~f+a-(fa):PP f-f:PP/N, a-a:N =~f+a+t-(fa):PP- Trt f-f:PP/N =~(f,t)-~a(fa):PPTN j-j:N =~j-j:N k-k:S =~k-k:S \L h-h:PP =~h-h:PP j-j:N, i-i:N\S =~j+i-(ij):S j-j:N, w-w:(N\S)/PP, h-h:PP =~j+w+h-((wh)j):S/L j-j:N, w-w:(N\S)/PP, h-h: PP =~j+w+h+t-((wh)j):STR j-j:N, w-w:(N\S)/PP =C/,(j+w, t)-),h((wh)j):STPP g-g:R =~g-g:R/L d-d:R/(STPP), j-j:N, w-w:(N\S)/PP =~d+(j+w,t)-(d),h((wh)j)):R IL f-f:PP/N, o-o:(R/(SI&quot;PP))I(PPTN), j-j:N, w-w:(N\S)/PP =~f+o+t+(j+w,t)-((o~a(fa))~h((wh)j)):R m f-f:PP/N, o-o:(R/(STPP))I(PPTN), j-j:N, w-w:(N\S)/PP =~f+o+(j+w,t)-((o),a(fa)))~h((wh)j)):R both blocking ordinary application, and licensing co-ordination with a left hand conjunct of the same category. The blocking is necessary because 'and Charles, phonetics' is clearly not of category TV\S: 'Studies and Charles, phonetics' is not a sentence.</Paragraph> <Paragraph position="4"> Now, with respect to the left hand conjunct, Steedman invokes a special decomposition of 'John studies logic' analysed as S, into TV and TV\S. There is then constituent coordination between TV\S and TV\S. Finally the coordinate structure of category TV\S combines with TV on the left to give S.</Paragraph> <Paragraph position="5"> Although this treatment addresses the two problems that any account of gapping must solve, categorisation of the right hand conjunct and access of the verbal semantics in the left hand conjunct, it attempts to do so within a narrow conception of categorial grammar (only division operators) that necessiates invocation of distinctly contrived mechanisms. We believe that the radical reconstruals of grammar implicated by this analysis are not necessary given the general framework including discontinuity operators we have set out. We address for the moment just our minimal example.</Paragraph> <Paragraph position="6"> Within the context of categorial grammar we have established, the right hand conjunct is characterisable as STTV. It remains to access the understood verbal semantics from the sentence that is the left hand conjunct. In order to recover from the left hand side the information we miss on the right hand side, we would like to say that this information, the category and semantics of the verb, is made available to the coordinator when it combines with the left conjunct. In accordance with the spirit of Steedman, we can observe that the left hand conjunct contains a part with the category SI&quot;TV of the right hand constituent, but it is discontinuous, being interpolated by TV. But this is precisely what is expressed by the discontinuous product category (STTV)(r)TV. Furthermore, an element of such a category has as its semantics a pair the second projection of which is the semantics of the TV. Consequently gapping is generated by assignment of 'and' to the category (((STTV)(r)TV)\S)/(STTV) with semantics ~x~y\[(rly lr2y) A (x 7r2y)\].</Paragraph> <Paragraph position="7"> The complete derivation for (27) is as in Figure 5, where TV abbreviates (N\S)/N. When we substitute the lexical prosodics (here each just a prosodic constant) for the prosodic variables in the conclusion, we obtain the prosodic form (28).</Paragraph> <Paragraph position="8"> John + studies + logic + and (28) + ( Charles, phonetics) Similarly substituting the lexical semantics (all semantic constants except for the coordinator semantics as above), we obtain the associated semantics (29) which evaluates as shown.</Paragraph> <Paragraph position="10"> ((studies phonetics) charles)\] Some generalisation to cover different categories of gapped element and different categories of coordination is given by straightforward schematisation. In general, gapping coordinator categories have the form ((Z (r) Y)\X)/Z where Z is XTY. In this scheme, X is the category of the resulting coordinate structure and Y is the category of the gapped materiM. This allows interaction with other coordination phenomena such as node raising. For example, a referee pointed out that gapping can occur within incomplete sentences thus: 'John gave a book and Peter, a paper, to Mary'. Such a case would be covered by the instanciation where Y is the ditransitive verb category and X is S/PP.</Paragraph> <Paragraph position="11"> For generalisation including multiple gapping (several discontinuous segments elided) see \[Solids, 1992\], which employs in addition operators formed by residuation with respect to tupling. That approach has certain affinities with \[Oehrle, 1987\], and makes it possible to begin to address examples of Oehrle's relating to scope and Boolean particles. The purpose of the present paper has been to lay the groundwork for empiricM inquiry into gapping and other notorious nonconcatenative phenomena, made possible in categorial grammar by a proper treatment of discon* tinuity.</Paragraph> </Section> class="xml-element"></Paper>