XML Viewer - c90-2030

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/90/c90-2030_metho.xml
Size: 25,134 bytes
Last Modified: 2025-10-06 14:12:24
<?xml version="1.0" standalone="yes"?>
<Paper uid="C90-2030">
  <Title>Normal Form Theorem Proving for the Lambek Calculus ~</Title>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
1 The Lambek Calculus
</SectionTitle>
    <Paragraph position="0"> The (product-free) Lambek Calculus (Lambek, 1958) is a highly flexible CG framework that can be given a Gentzen..style sequent formulation with the following  rules :23 (1) Axiom: x:f =~- x:/ Righ~ rules: F, y:i ~ x:f y:i, F ~ x:f F ----&gt; x/y:Ai.f \[/R\] F -~ x\y:Ai.f \[\rt\] Left :rules: &amp;, =.~ y:g F, x:fg, A ~ z:h I/L\] F, x/y:f, A, A =&gt; z:h ~-~ y:g F, x:fg, A =&gt; ~:h .............. \[\L\] F, A, x\y:f, A =C/- z:h Cut rule: A ==&gt; x:\] F, x:f, A --&gt; y:g \[cut\] F, A, A --'~- y:g  In this formulation, which we will call L, each type is associated with a lambda expression, corresponding to its meaning, and the rules specify tlow these are contructed. The left rules con'espond semantically to functional application, the right rules to functional abstraction. The lambda expression for the suecedent of the root sequent corresponds to its meaning as a combinaffion of the meanings of the antecedent types. This will be loosely refered to as the 'meaning (or  derivability relation, indicates that x can be derived from F. Specific to '~he Lambek Calculus, we require that each sequent has a non-empty antecedent sequence and precisely one sucredent type. We call the bottom-most sequent of any proof the root, and the inference that has the root as its conclusion the root inference. We call to the right hand premise of a left rule its major premise, its other the minor premise. In addition, we call the nudn branch of any proof that (unique) path Sl, s 2..,sn through the proof such that s I is the root sequent, sn corresponds to an axiom instance, and non of s2,..,sn is the minor premise of a left inference.</Paragraph>
    <Paragraph position="1"> reading) assigned by the proof'. We assume that each antecedent type of the initial sequent is assigned a distinct variable for its semantics. 4 The semantics of types will sometimes be omitted to save space.</Paragraph>
    <Paragraph position="2"> Lambek (1958) demonstrates that the cut rule is eliminable in the sense that all theorems of the calculus Call be proven using just the remaining rules. These remaining rules provide a decision procedure for theoremhood (since each inference removes a connective, so that search for proofs is finite), and we consider only these in the rest of the paperfi</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="173" type="metho">
    <SectionTitle>
2 The Problem of Multiple Equivalent Proofs
</SectionTitle>
    <Paragraph position="0"> The calculus L typically allows more than one proof assigning each possible reading for a sequent. We shall consider some illustrative examples. The following two proofs assign the same reading:  z:h =:&gt; z:h y:gh ~ y:gh i/L\] y/'~:g, z:h y:gh x:fgh * :&gt; ~ x.:gh I/L\] x/y:f, y/'~:g, z:h o x:fgh y:gh ~ y:gh x:fgh ~ x:fgh \[/L\] ~:h ~ ~:h x/y:f, y:gh ::~ x:f#h \[/L\] x/y:/, y/~:g, z:h :C/ x:fgh  Here we see that the operations involved in 'building' some argument type (here y) may be performed either before or after the left inference on the functor requiring that argument (here x/y). Another example of distinct proofs assigning the same meaning is the following (in both of which the subproof for the premise x/y, y =&gt; w/(w\x) is omitted):</Paragraph>
    <Paragraph position="2"> These show a case where a right inference can equally well be made either before or after some unrelated left  of proofs aside from contingent equivalences that may stein from particular semantic assignments to antecedents. 6Moor~gat (1990) demonstrates that cut elimination preserves the strong recogrdsing capacity of the calculus in the sense that the systems with and without cut will yield precisely the same readings for any theorem modulo logical equivalence.  inference. A final example:</Paragraph>
    <Paragraph position="4"> Here we see that where the type instantiating an axiom is functional, it is also possible to 'unpack' the type and 'equate' subformulas of this in further axiom instances. The lambda expressions assigned by these two proofs are equivalent under r/-reduction.</Paragraph>
    <Paragraph position="5"> The existence of multiple equivalent proofs presents a problem for efficient theorem proving based on the calculus L. Search for proofs must be exhaustive to ensure that all different 'readings' for a given sequent are found, and a naive theorem prover will expend considerable effort constructing proofs that assign the same meaning. This radically reduces the efficiency of Lambek Calculus theorem proving. Hepple and Morrill (1989), working with a somewhat different CG framework that also admits multiple equivalent proofs, suggest that this problem be dealt with by deriving a notion of normal form (NF) for proofs, and then adapting the parsing method such that this only returns NF proofs. Khnig (1989) takes this kind approach to handling the problem of multiple equivalent proofs for Lambek sequent theorem proving, generating a notion of NF for such proofs, and deriving from this a parsing strategy intended to compute only NF proofs. :However, Khnig's parsing algorithm fails to exclude all non-NF proofs when used with a standard propositional Lambek Calculus.</Paragraph>
    <Paragraph position="6"> In this paper I define a NF system for the sequent formulation of the (product-free) Lambek Calculus, which gives rise to a parsing approach that yields only normal proofs.</Paragraph>
  </Section>
  <Section position="4" start_page="173" end_page="177" type="metho">
    <SectionTitle>
3 A New Approach
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="173" end_page="174" type="sub_section">
      <SectionTitle>
3.1 Headed Proofs
</SectionTitle>
      <Paragraph position="0"> We begin by introducing the notion of head type, as this is crucial to the approach. We use symbols P, Q, R, etc to designate proofs, and the notation P(S) to represent a proof P of the sequent S. Intuitively, the head type of a sequent S under some proof P (S) is the antecedent type of S that has widest scope semantically. The meaning assigned by a proof is always a lambda expression of the form: 6</Paragraph>
      <Paragraph position="2"> The head type here is the antecedent x\y/z since it is the semantic variable g of this that has widest ,~cope in the meaning assigned by the proof. Note the characteristic of this proof that a value subpart of the head type is 'equated' with a value subpart of the succedent type in the sense that these together iustantiate opposite sides of an axiom instance, in particular that axiom instance at the top of the main br,~nch (cf.</Paragraph>
      <Paragraph position="3"> footnote 3). This fact is the syntactic counterpart of the head type having the widest scope semantically. 7 More generally, for the head H of a sequent under some proof and the sole antecedent A of the proof's main branch axiom, it is always the case that A is a value subformula of H (or is H itself). Not all proofs have a head, as in (4), where the variable that has widest scope in the meaning assigned by the proof does not originate with an antecedent of the initial sequent, but rather with a type that is 'introduced' in a right inference:</Paragraph>
      <Paragraph position="5"> We use numerically subscripted proof symbols (e.g.</Paragraph>
      <Paragraph position="7"> proofs, in particular a proof Pn, n &gt; 1, is a headed proof, with the head being the nth member of the antecedent sequence of the root sequent. If n = 0, then the proof is unheaded. Given the relation mentioned above between the head of a proof and the main branch axiom of this, we can give a recursive characterization of the form of subscripted proofs as in (5) (where superscripted Greek letters are used to indicate sequences of given numbers of types, i.e.</Paragraph>
      <Paragraph position="8">  71&amp;quot; n corresponds to a sequence of n types): (5) Each proof Pn is of one of the forms: a. axiom where n = 1 x==~x (2) Avl..v,~.(hU1..Um) (n, m &gt;_ O)  where h is some semantic variable. The head type of the sequent under the given proof is that antecedent whose meaning is the variable h. Consider (3): b. qraCy, iv ::~ x) where ((m &gt; 0) &amp; (n =: m- 1)) \[\RI or (Cm = 03 &amp; (~ --: 0)) 71- =:~ x\y e. Qm(r k, y =v. x) where ((m : k + 1) &amp; (n --- 0)) ~k ~ x/y \[/rt\] or CC'~ &lt; k) ~ C&amp;quot; ::: &amp;quot;q) eNote that the meaning assigned by any (cut-free) proof is always be a lambda expression in/~-normal form (i.e. containing no subterms of the form (Av.P)Q) -- provided that the meanings assigned to the root sequent antecedents are in f~normal (which is the case here, since we assign these distinct variables) as may be shown by an inductive proof.</Paragraph>
      <Paragraph position="9"> 7Note the semantic significance of the main branch axiom in this. The semantics assigned by a proof P always corresponds to that of the succedent of the main branch axiom of P having undergone zero or more lambda abstractions. This follows from the semantics of the rules of L.</Paragraph>
      <Paragraph position="10">  d. R(s k.~y) q;(C/%x,C/~z) \[\L\] C/~ ~k x\y, C/ . z who,'e ((~&amp;quot; &lt; .*) a (~ = ~')) or ((j &gt; m) &amp; C n=j+k)) e. ~(~k,:~y) QsCC/'Lx,C/~) \[/L\] C/,,, ~/y, ~k, C/ =~,  where ((j _&lt; m + 1) &amp; (n = j)) or ((j &gt; m + 1) ~ (,, = i + k)) The base case for the definition is where a subproof consists only of an axiom inference, in which case the head of the proof is the single antecedent member (and hence, n = 1). From there, the position of the hexad can be kept track of by counting the number o:l' antecedents added in beside the head as subproofs are combined. Note that every cut-free proof in L is a proof P~ '.for some value of n.</Paragraph>
      <Paragraph position="11"> 82 A Constructive Notion of Normal Form In (6), a recursive definition is given for a constructive notion of NF (CNF). For this definition, it is u,.;eful to distinguish two subtypes of proof: T 1 and &amp;quot;r:!. The set of ~'1 proofs is precisely the set of CNF proofs. The 'r2 proofs are a subset of the 71 proofs-those whose main branch includes no right inferences.  (6) The set of CNF (T1) proofs is the smallest set such that: a. i\] x is an atomic type, then axiom E T 2</Paragraph>
      <Paragraph position="13"> Given (6a), CNF proofs only contain axiom leaves that are instantiated with atomic types. (6b) allows fo:c the occurrence of left inferences in CNF proofs, aI~d requires that these must be with respect to the head of the major prenfise's subproof (~major subproof'). Given (6c), every ~r 2 proof is a 71 proof also. Given (6d), only T1 proofs may have a right rule the root inference--no ~r2 proof will be of this form.</Paragraph>
      <Paragraph position="14"> Since the major subproof of a left inference must be a 7 2 proof, a right inference may never occur above sL left inference on the main branch of a CNF proof.</Paragraph>
      <Paragraph position="15"> Thus, the main branch of a CNF proof is always of the following form (starting at the root): zero or more rigilt inferences, followed by zero or more left inferences, terminating with an axiom inference. The minor subproofs of left inferences are T 1, and so are also of this general form.</Paragraph>
      <Paragraph position="16"> We will see later how we can ensure that the theorem prover generates all and only the CNF proofs of a sequent. However, for such a move to be useful, the notion of CNF must satisfy certain requirements.</Paragraph>
      <Paragraph position="17"> Most importantly, generating only CNF proofs should be safe in the sense that the theorem prover returns every distinct reading of a sequent that can be assigned. This will be the case if for every proof of a sequent, there exists a semantically equivalent CNF proof. To demonstrate this, a second notion of normal form will be defined, using the method of proof reduction, which is equivalent to the constructive notion of NF, and provably complete.</Paragraph>
    </Section>
    <Section position="2" start_page="174" end_page="176" type="sub_section">
      <SectionTitle>
3.3 A Reductive Notion of Normal Form
</SectionTitle>
      <Paragraph position="0"> A standard approach to defining NFs involves defining a contraction relation (1&gt;1) between terms. The contraction relation is stated as a number of contraction rules, each of the form X ~&gt;1 Y (in which the form on the left is termed a redex and the form on the right its contractum). Each contraction rule allows that any term containing an occurrence of a redex may be transformed into a term in which that occurrence is replaced by its contractum. A term is said to be in NF if and only if it contains no occurrences of any redex. The contraction relation generates a reduction relation (I&gt;) which is such that X reduces to Y (X i&gt; Y) if and only if Y is obtained from X by a finite series (possibly zero) of contractions. A term Y is a NF of X if and only if Y is ~NFandX ~- Y.</Paragraph>
      <Paragraph position="1">  lus We shall next consider a set of contraction rules stated on proofs in L. 8 These together define a reductive notion of NF. A total of eighteen contraction rules are required, which fall into four groups.</Paragraph>
      <Paragraph position="2"> An instance of the first group of contraction rules is shown in (7).</Paragraph>
      <Paragraph position="4"> This contraction rule expands an axiom leaf insta:ntinted with a functional type to give a subproof containing two axiom leaves, each of which is instantiated with a 'simpler' type than the original axiom (under some metric of simplicity). There is a second rule in this group for which the functional type in the redex is leftward directional.</Paragraph>
      <Paragraph position="5"> An instance of the second group of contraction rules is shown in (8). In the redex, a right inference is applied to the major premise of left inference. In the contractum, the subproof has been restructured so SProof reduction originates with Prawitz (1965).</Paragraph>
      <Paragraph position="6">  that the right inference is applied to the root sequent, and the left inference to its sole premise.</Paragraph>
      <Paragraph position="8"> There are four contraction rules of this group which arise with the directionality of the connectives for the two inferences.</Paragraph>
      <Paragraph position="9"> An instance of the third group of contraction rules is shown in (9). Note that this makes use of the subscripted proof notation, allowing us to reeognise where a left inference is with respect to the head of the stated subproof. In the subproof P,,+1(C/ '~, x, C/, v, c~ =:&gt; z), the head is the type x. It follows that the type x/y is the head of the entire redex proof, and that v/w is not. Thus we can see that in the redex, a head left inference (i.e a left inference with respect to the head of the given subproof) is applied to the major premise of non-head left inference. In the contractum, the subproof has been restructured so that the head inference is applied to the root sequent, and the non-head inference to its major premise.</Paragraph>
      <Paragraph position="11"> I/L} C/', xly, ~r&gt; C/, vlw, ~, a ~ There are eight contractio-1 rules in this group, which arise with the directionality of the connectives involved, and whether the head functor occurs to the right or left of the non-head functor.</Paragraph>
      <Paragraph position="12"> An instance of the fourth group of contraction rules is shown in (10). In the redex, a head-left inference is applied to the major premise of a non-head left inference, where the latter can be seen to in-part serve the purpose of 'building' the argument required by the head functor. In the coutractum, the inferences have been reordered so that the head inference applies to the root sequent, and the non-head inference applies to the minor premise of this.</Paragraph>
      <Paragraph position="14"> We :require four contraction rules of this pattern,  which arise with the directionality of two functors (the left-right order of the two functors being determined by the head functor).</Paragraph>
      <Paragraph position="15"> We call any proof that contains no redexes (which is consequently a NF under the reduction system) an irreducible normal form (INF). It is easily verified that for each contraction rule, the contractum subproof assigns a lambda expression for the root sequent combination equivalent to that assigned by the redex. Thus, meaning is preserved under contraction, and also in turn under reduction. Hence, an INF of any proof P always assigns the same reading as P.</Paragraph>
      <Paragraph position="16"> We will next demonstrate that normalisation holds for the reductive NF system; that is, that every proof has an INF. This property follows provided that the following (stronger) property holds: (11) Strong Normalisation: Every reduction is finite. Thus, for every proof, a finite series of contractions always leads to an irreducible proof.</Paragraph>
      <Paragraph position="17"> To prove that every reduction is finite, it is sufficient to give a metric that assigns to each proof a finite non-negative integer score, and under which it can be shown that every application of a contraction rule reduces the score of a proof by a positive integer amount. The scoring system given in (12) satisfies these requirements?</Paragraph>
      <Paragraph position="19"> score for any proof P (written sc(P )) is as follows: if P is an axiom leaf instantiated with type x then so(P) = factorial(3 ate(x)) b. ff P has a right inference at its root, wi~h premise subproof Q then sc(P) = sc(Q) + 1 c. if the root inference of P is a head left inference, with major subproof Q and minor subproof R then</Paragraph>
      <Paragraph position="21"> d. if the root inference of P is a non-head left inference, with major subproof Q and minor subproof</Paragraph>
      <Paragraph position="23"/>
    </Section>
    <Section position="3" start_page="176" end_page="176" type="sub_section">
      <SectionTitle>
3.4 Equivalence of the Two Systems
</SectionTitle>
      <Paragraph position="0"> We write CNF(P) and INF(P) to indicate that a proof is in NF under the respective systems. The two NF systems can be shown to be equivalent in that they designate precisely the same sets of proofs to be in NF, i.e: (13) Lemma h for all P, CNF(P) ~-~ INF(P) This follows given Lemmas 2 and 3.</Paragraph>
      <Paragraph position="1"> (14) Lemma2: for all P, ~INF(P) --~ ~CNF(P) It is easily checked that no redex subproof is in CNF. Hence, any reducible proof is not a CNF.</Paragraph>
      <Paragraph position="2"> (15) Lemma 3: for all P, -~CNF(P) --* -,INF(P)</Paragraph>
      <Paragraph position="4"> This may be proven by induction on P Given the equivalence of the two NF systems, and the normalisation result for the reductive NF system, it follows that for every proof in L, there exists a CNF proof that assigns the same reading. Hence, generating all and only the CNF proofs of a given sequent is safe in. the sense that we can be sure all possible readings for the combination will be returned.</Paragraph>
    </Section>
    <Section position="4" start_page="176" end_page="176" type="sub_section">
      <SectionTitle>
3.5 The Uniqueness of Normal Forms
</SectionTitle>
      <Paragraph position="0"> The approach wi!l be optimal if there is only a single CNF asslgnhlg each distinct reading. As we have already noted (footnote 6), the meaning assigned by any (cut-free) proof will be a lambda expression that is in/%normal form. Extensional equivalence for the Lambda Calculus is given by the /~\]-reduction system. Since this system exhibits the Church-Rosser property~ any two terms in fl-normal form will be equivalent just in case applying some sequence of ~/o contractions (i.e. Av.fv ~&gt;1 f) to each yields the same term (nmdulo c~-conversion).</Paragraph>
      <Paragraph position="1"> Let us consider two distinct CNF proofs P and Q of some sequent S. Assume that these differ ill some regard along their main branches. The bottom part of each main branch will consist of a sequence of zero or more right inferences. The length and nature of this sequence of inferences is fully determined by the type of the succedent in the root sequent S, and so P and Q may not differ in this regard. Thus, the meanings assigned by P and Q will be lambda expressions of the following forms (in which the lambda abstractions Av:..v,~. correspond to the initial sequence of right inferences):</Paragraph>
      <Paragraph position="3"> The remainder of each main branch will consist of some sequence of head left inferences, terminating in an axiom instance. Consequently, the two CNF proofs may differ along their main branch only in two regards: (i) a different antecedent type is the functor for the first left inference (this determining the functor for the remaining left inferences on the main branch), (ii) even if the same type is chosen for the functor of the first left inference, at some point a different ,;ubsequence of antecedents is chosen to 'construct' the argument of the functor. In either case, the semantics assigned to the two proofs will be nonequivalent. Thus, for case (i) f and g will be distinct variables. In both cases~ it may be that m : \]c, and even if ra --= \]c, at least some of the Ui and Ui I will differ in the lambda variables that m'e put together to form these (i.e. the lambda expressions corresponding to antecedent semantics). It is evident that in any of these eventualities, no sequence of ~?-contractions can yield a common term from two such terms and so the two proofs must assign distinct readings. If the two NF proofs are similar along theh&amp;quot; main branches, they must differ in the minor subproof for some left inference. The same arguments apply in turn that if these subproofs differ in form, then they must be nonequivalent. Hence, distinct CNF proofs always differ semantically, and so the uniqueness property holds.</Paragraph>
      <Paragraph position="4"> Since reduction preserves meaning, and distinct NFs are non-equivalent, it follows that every proof has a unique NF. 1deg</Paragraph>
    </Section>
    <Section position="5" start_page="176" end_page="177" type="sub_section">
      <SectionTitle>
3.6 Normal Form Theorem Proving
</SectionTitle>
      <Paragraph position="0"> For the purpose of theorem proving, we specify an amended version of the calculus, which we term the 'p~sing calculus' (PC), which is such as to only allow the construction of proofs that con'espond to CNF proofs. To this end, we move from a system which uses a single notion of sequent, to one which distinguishes two different types of sequent Type 1 and Type 2--this type being marked as a subscript on the derivability arrow =&gt;. The point of this distinction is to allow us to limit the form of the proof that may be given for some sequent merely by its type. The Type 1/Type 2 distinction essentially corresponds to the ~1/72 distinction in the definition of CNF. For Type 2 sequents we distinguish the antecedent element that must be the head of any proof of the sequent by dividing the antecedent sequence into three subparts, separated by -+-'s, as for example in C/+x+Tr =&gt; z, of which the first and last parts are (possibly empty) sequences of types, and the middle paa't a single type that is required to be the head of any proof of this sequent.</Paragraph>
      <Paragraph position="1"> The following set of inference rules allow only proofs of the required form to be constructed: (16)Axiom: e~-x:f-~e =~,,~x:f where x is an atomic type (an(l e denotes the  \[2~-*1\] type, and x := y or x A, x:f, r -~ly:g a function onto y Observe that the axiom inference specifies a Type 2 sequent, with the single antecedent type being the designated head of that sequent. This corresponds to clause (a) of the definition of CNF in (6). The left inference rules have Type 2 sequents for their conclusion and major premise and a Type 1 sequent for the minor premise. Note that the active type for the inference is the designated head of the conclusion and its value subformula is the designated head of the major premise. This corresponds to clause (b) of the CNF definition. The right rules have a Type 1 leFrom this it follows that the reductive NF system exhibits the Church-Rosser property,  premise and conclusion, in line with clause (d) of the CNF definition. In addition, there is a rule \[2~-41\], corresponding to the clause (c) of the CNF definition. Note there is a condition on the rule which does not correspond to anything hi clause (c). It is easily shown that this condition is always fulfilled by T2 proof~s, so its addition does not alter the set of proofs admitted by the PC. However, the condition will considerably limit the application of the \[2~-.1\] rule in top-down search for proofs and so increase the efficiency of theorem proving. It is easily shown that PC and CNF proofs exist in 1-to-1 correspondence.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML