File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/98/p98-1088_metho.xml

Size: 20,650 bytes

Last Modified: 2025-10-06 14:14:54

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-1088">
  <Title>Memoisation for Glue Language Deduction and Categorial Parsing</Title>
  <Section position="4" start_page="0" end_page="538" type="metho">
    <SectionTitle>
2 Multiplicative Linear Logic
</SectionTitle>
    <Paragraph position="0"> Linear logic is a 'resource-sensitive' logic: in any deduction, each assumption ('resource') is used precisely once* The formulae of the multiplicative fragment of (intuitionistic) linear logic are defined by ~&amp;quot; ::= A I ~'o-~&amp;quot; J 9 v (r) ~ (A a nonempty set of atomic types). The following rules provide a natural deduction formulation:  The elimination (E) and introduction (I) rules for o-- correspond to steps of functional application and abstraction, respectively, as the term labelling reveals. The o--I rule discharges precisely one assumption (B) within the proof to which it applies. The (r)I rule pairs together the premise terms, whereas (r)E has a substitution like meaning. 1 Proofs that Wo--(Xo--Z), Xo--Y, Yo--Z =~ W and that</Paragraph>
    <Paragraph position="2"/>
    <Paragraph position="4"> The differential status of the assumptions and goal of a deduction (i.e. between F and A in F =v A) is addressed in terms of polarity: assumptions are deemed to have positive polarity, and goals negative polarity. Each Sub-formula also has a polarity, which is determined by the polarity of the immediately containing (sub)formula, according to the following schemata (where 15 is the opposite polarity to p): (i) (X p o--Y~)P (ii) (X p(r)Yp)p For example, the leftmost assumption of the first proof above has the polarity pattern</Paragraph>
    <Paragraph position="6"> the phenomenon of 'hypothetical reasoning', where additional assumptions (called 'hypotheticals') are used, which are later discharged. The need for hypothetical reasoning in a proof is driven by the types of the assumptions and goal: the hypotheticals correspond to positive polarity subformulae of the assumptions/goal that occur in the following subformula contexts:</Paragraph>
    <Paragraph position="8"> The subformula (Xo-Z) of Wo--(Xo-Z) in the proof above is an instance of context (i), so a hypothetical Z results. Subformulae that are instances of patterns (i,ii) may nest within other such instances (e.g. in ((A(r)B)(r)C)o-D, both ((A(r)B)@C) and (A(r)B) are instances of (ii)).</Paragraph>
    <Paragraph position="9"> In such cases, we can focus on the maximal pattern instances (i.e. not contained within any other), and then examine the hypotheticals produced for whether they in turn license hypothetical reasoning. This approach makes explicit the patterns of dependency amongst hypothetical elements.</Paragraph>
  </Section>
  <Section position="5" start_page="538" end_page="539" type="metho">
    <SectionTitle>
3 First-order Compilation for
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="538" end_page="539" type="sub_section">
      <SectionTitle>
Implicational Linear Logic
</SectionTitle>
      <Paragraph position="0"> Hepple (1996) shows how deductions in implicational linear logic can be recast as deductions involving only first-order formulae, using only a single inference rule (a variant of o-E). The method involves compiling the original formulae to indexed first-order formulae, where a higher-order 2 initial formula yields multiple compiled formulae, e.g. (omitting indices) Xo--(Yo--Z) would yield Xo--Y and Z, i.e. with the sub-formula Z, relevant to hypothetical reasoning, being excised to be treated as a separate assumption, leaving a first-order residue. 3 Indexing is used to ensure general linear use of resources, but also notably to ensure proper use of excised subformulae, i.e. so that Z, in our example, must be used in deriving the argument of Xo-Y, or otherwise invalid deductions would result). Simplifying Xo--(Yo--Z) to Xo--Y removes the need for an o--I inference, but the effect of such a step is not lost, since it is compiled into the semantics of the formula.</Paragraph>
      <Paragraph position="1"> The approach is best explained by example.</Paragraph>
      <Paragraph position="2"> In proving Xo--(Yo--Z), Yo-W, Wo--Z =v X, the premise formulae compile to the indexed formulae (1-4) shown in the proof below. Each of these formulae (1-4) is associated with a set containing a single index, which serves as a unique identifier for that assumption.</Paragraph>
      <Paragraph position="3">  1.</Paragraph>
      <Paragraph position="4"> 2. {j}:Z:z 2. {k}:Yo--(W:0):Au.yu 4.</Paragraph>
      <Paragraph position="5"> 5. {j, 1} :W:wz 6. {j,k,l} :Y:y(wz) 7. {i,j, k,l}: X:x( z.y(wz))</Paragraph>
      <Paragraph position="7"> The formulae (5-7) arise under combination, allowed by the single rule below. The index sets of these formulae identify precisely the assumptions from which they are derived, with appropriate indexation being ensured by the condition 7r = C/~C/ of the rule (where t2 stands for disjoint union, which enforces linear usage).</Paragraph>
      <Paragraph position="8"> C/:Ao--(B:a):)~v.a C/:B:b 7r = C/~C/ rr: A: a\[b//v\] 2The key division here is between higher-order formulae, which are are functors that seek at least one argument that bears a a functional type (e.g. Wo--(Xo--Z)), and first-order formulae, which seek no such argument. 3This 'excision' step has parallels to the 'emit' step used in the chart-parsing approaches for the associative Lambek calculus of (KSnig, 1994) and (Hepple, 1992), although the latters differs in that there is no removal of the relevant subformula, i.e. the 'emitting formula' is not simplified, remaining higher-order.</Paragraph>
      <Paragraph position="9">  Assumptions (1) and (4) both come from Xo-(Yo--Z): note how (1)'s argument is marked with (4)'s index (j). The condition c~ C C/ of the rule ensures that (4) must contribute to the derivation of (1)'s argument. Finally, observe that the rule's semantics involves not simple application, but rather by direct substitution for the variable of a lambda expression, employing a special variant of substitution, notated _\[_//_\], which specifically does not act to avoid accidental binding. Hence, in the final inference of the proof, the variable z falls within the scope of an abstraction over z, becoming bound. The abstraction over z corresponds to an o-I step that is compiled into the semantics, so that an explicit inference is no longer required. See (Hepple, 1996) for more details, including a precise statement of the compilation procedure.</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="539" end_page="541" type="metho">
    <SectionTitle>
4 First-order Compilation for
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="539" end_page="541" type="sub_section">
      <SectionTitle>
Multiplicative Linear Logic
</SectionTitle>
      <Paragraph position="0"> In extending the above approach to the multiplicative, we will address the (r)I and @E rules as separate problems. The need for an (r)I use within a proof is driven by the type of either some assumption or the proof's overall goal, e.g. to build the argument of an assumption such as Ao-(B@C). For this specific example, we might try to avoid the need for an explicit @I use by transforming the assumption to the form Ao-Bc-C (note that the two formulae are interderivable). This line of exploration, however, leads to incompleteness, since the manoeuvre results in proof structures that lack a node corresponding to the result of the (r)I inference (which is present in the natural deduction proof), and this node may be needed as the locus of some other inference. 4 This problem can be overcome by the use of goal atoms, which are unique pseudo-type atoms, that are introduced into types by compilation (in the parlance of lisp, they are 'gensymmed' atoms). An assumption Ao-(B@C) would compile to Ao--G plus Go-Bo-C, where G is the unique goal atom (gl, perhaps). A proof using these types does contain a node corresponding to (what would be) the result of the @ inference in the natural 4Specifically, the node must be present to allow for steps corresponding to @E inferences. The expert reader should be able to convince themselves of this fact by considering an example such as Xo-((Y(r)U)~-(Z(r)U)), Yo-Z ~ X.</Paragraph>
      <Paragraph position="1"> deduction proof, namely that bearing type G, the result of combining Go--Bo-C with its arguments. null This method can be used in combination with the existing compilation approach. For example, an initial assumption Ao-((B(r)C)o--D) would yield a hypothetical D, leaving the residue Ao-(B@C), which would become Ac~-G plus Go--Bo-C, as just discussed. This method of uniquely-generated 'goal atoms' can also be used in dealing with deductions having complex types for their intended overall result (which may license hypotheticals, by virtue of realising the polarity contexts discussed in section 2). Thus, we can replace an initial deduction F =~ A with Co--A, F ~ G, making the goal A part of the left hand side. The new premise Go---A can be compiled just like any other. Since the new goal formula G is atomic, it requires no compilation. For example, a goal type Xo-Y would become an extra premise Go--(Xo--Y), which would compile to formulae Go-X plus Y.</Paragraph>
      <Paragraph position="2"> Turning next to (r)E, the rule involves hypothetical reasoning, so compilation of a maximal positive polarity subformula B(r)C will add hypotheticals B,C. No further compilation of B(r)C itself is then required: whatever is needed for hypothetical reasoning with respect to the internal structure of its subformulae will arise elsewhere by compilation of the hypotheticals B,C. Assume that these latter hypotheticals have identifying indices i, j and semantic variables x, y respectively. A rule for (r)E might combine B(r)C (with term t, say) with any other formula A (with term s, say) provided that the latter has a disjoint index set that includes i, j, to give a result that is also of type A, that is assigned semantics E~y(t, s). To be able to construct this semantics, the rule would need to be able to access the identities of the variables x, y. The need to explicitly annotate this identity information might be avoided by 'raising' the semantics of the multiplicative formula at compilation time to be a function over the other term, e.g. t might be raised to Au.E~y(t,u). A usable inference rule might then take the following form (where the identifying indices of the hypotheticals have been marked on the product  Note that we can safely restrict the rule to require that the type A of the minor premise is atomic. This is possible since firstly, the first-order compilation context ensures that the arguments required by a functor to yield an atomic result are always present (with respect to completing a valid deduction), and secondly, the alternatives of combining a functor with a multiplicative under the rule either before or after supplying its arguments are equivalent. 5 In fact, we do not need the rule above, as we can instead achieve the same effects using only the single (o--) inference rule that we already have, by allowing a very restricted use of type polymorphism. Thus, since the above rule's conclusion and minor premise are the same atomic type, we can in the compilation simply replace a formula XNY, with an implication .Ao---(.A: {i,j}), where ,4 is a variable over atomic types (and i,j the identifying indices of the two hypotheticals generated by compilation). The semantics provided for this functor is of the 'raised' kind discussed above. However, this approach to handling (r)E inferences within the compiled system has an undesirable characteristic (which would also arise using the inference rule discussed above), which is that it will allow multiple derivations that assign equivalent proof terms for a given type combination. This is due to non-determinism for the stage at which a type such as Ao---(A: {i,j}) participates in the proof. A proof might contain several nodes bearing atomic types which contain the required hypotheticals, and Ao-(al: {i, j}) might combine in at any of these nodes, giving equivalent results. 6 The above ideas for handling the multiplicative are combined with the methods developed</Paragraph>
      <Paragraph position="4"> The move of requiring the minor premise to be atomic effects a partial normalisation which involves not only the relative ordering of (r)E and o--E steps, but also that between interdependent (r)E steps (as might arise for an assumption such as ((ANB)(r)C)). It is straightforward to demonstrate that the restriction results in no loss of readings. See (Benton et al., 1992) regarding term assignment and proof normalisation for linear logic.</Paragraph>
      <Paragraph position="5"> 6It is anticipated that this problem can be solved by using normalisation results as a basis for discarding partial analyses during processing, but further work is required in developing this idea.</Paragraph>
      <Paragraph position="6"> for the implicational fragment to give the compilation procedure (~-), stated in Figure 1. This takes a sequent F =&gt; A as input (case T1), where A is a type and each assumption in F takes the form Type:Sere (Sere minimally just some unique variable), and it returns a structure (~, C/, A}, where ~ is a goal atom, C/ the set of all identifying indices, and A a set of indexed first order formulae (with associated semantics).</Paragraph>
      <Paragraph position="7"> Let A* denote the result of closing A under the single inference rule. The sequent is proven iff (C/, ~, t) E A* for some term t, which is a complete proof term for the implicit deduction. The statement of the compilation procedure here is somewhat different to that given in (Hepple, 1996), which is based on polar translation functions. In the version here, the formula related cases address only positive formulae. T As an example, consider the deduction Xo--Y, Y(r)Z =&gt; XNZ. Compilation returns the goal atom gO, the full index set {g, h, i, j, k, l}, )lus the formulae show in (1-6) below.</Paragraph>
      <Paragraph position="8">  1. ({9},gOo-(gl: {h}),At.t) 2. ({h},glo-(X:O)o-(Z:O),AvAw.(w (r)v)) 3. ({i},Xo-(Y:O),kx.(ax)) 4. ({j},A~-(A: {k, 0), ~.E~z(b, u)&gt; 5. {{k},Y,y} 6. ({/},Z,z) 7. ({i, k}, X, (ay)) \[3+5\] 8. &lt;{h,l},glo---(X:O),Aw.(w(r)z)) \[2+6\] 9. {{h,i, k,l}, gl, ((ay) (r) z)) \[7+8\] 10. ({h,i,j,k,l},gl, E~z(b,((ay)(r)z))) \[4+9\] 11. ({g,h,i,j,k,l},gO, E~(b,((ay)(r)z))) \[1+11\] 12. {{g, h,i, k, l}, gO, ((ay) (r) z)) \[1+9\] 13. ({9, h,i,j,k,l},gO, E~(b,((ay) Nz))) \[4+12\]  The formulae (7-13) arise under combination.</Paragraph>
      <Paragraph position="9"> Formulae (11) and (13) correspond to successful overall analyses (i.e. have type gO, and are labelled with the full index set). The proof illustrates the possibility of multiple derivations 7Note that the complexity of the compilation is linear in the 'size' of the initial deduction, as measured by a count of type atoms. For applications where the formulae that may participate are preset (e.g. they are drawn from lexicon), formulae can be precompiled, although the results of precompilation would need to be parametised with respect to the variables/indices appearing, with a sufficient supply 'fresh' symbols being generated at time of lexical access, to ensure uniqueness.</Paragraph>
      <Paragraph position="11"> where Y is atomic; x a fresh variable; 7-((4, X1, (sx))) = (4, X2, t) +ttJF (T5) 7-((4, Xo-((Yo-Z): C/), s)) = 7-((C/, Xo-(Y: ~r), Ay.s()~z.y))) U 7-((i, Z, z)) where i a fresh index; y, z fresh variables; 7r = i U C/ (7-6) 7-((4, Xo-((Y (r) Z): C/), s)) = 7-((4, Xo-(G: ~), s)) u 7-((i, ~o-Yo-Z, ~z~y.(y (r) z))) where i a fresh index; G a fresh goal atom; y, z fresh variables; 7r = i U</Paragraph>
      <Paragraph position="13"> where i, j fresh indices; x, y, t fresh variables; .4 a fresh variable over atomic types  assigning equivalent readings, i.e. (11) and (13) have identical proof terms, that arise by non-determinism for involvement of formula (4).</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="541" end_page="541" type="metho">
    <SectionTitle>
5 Computing Exclusion Constraints
</SectionTitle>
    <Paragraph position="0"> The use of inclusion constraints (i.e. requirements that some formula must be used in deriving a given functor's argument) within the approach allows us to ensure that hypotheticals are appropriately used in any overall deduction and hence that deductions are valid. However, the approach allows that deduction can generate some intermediate results that cannot be part of an overall deduction. For example, compiling a formula Xo--(Yo--(Zo--W))o--(Vo-W) gives the first-order residue Xo-Yo--V, plus hypotheticals Zo-W and W. A partial deduction in which the hypothetical Zo-W is used in deriving the argument V of Xo--Yo-V cannot be extended to a successfull overall deduction, since its use again for the functor's second argument Y (as an inclusion constraint will require) would violate linear usage. For the same reason, a direct combination of the hypotheticals Zo-W and W is likewise a deductive dead end.</Paragraph>
    <Paragraph position="1"> This problem can be addressed via exclusion constraints, i.e. annotations to forbid stated formulae having been used in deriving a given funtor's argument, as proposed in (Hepple, 1998). Thus, a functor might have the form Xo---(Y:{i}:{j}) to indicate that i must appear in its argument's index set, and that j must not.</Paragraph>
    <Paragraph position="2"> Such exclusions can be straightforwardly computed over the set of compiled formulae that derive from each initial assumption, using simple (set-theoretic) patterns of reasoning. For example, for the case above, since W must be used in deriving the argument V of the main residue formula, it can be excluded from the argument Y of that formula (which follows from the disjointness condition on the single inference rule). Given that the argument Y must include Zo--W, but excludes W, we can infer that W cannot contribute to the argument of Zo--W, giving an exclusion constraint that (amongst other things) blocks the direct combination of Zo--W and W. See (Hepple, 1998) for more details (although a slightly different version of the first-order formalism is used there).</Paragraph>
  </Section>
  <Section position="8" start_page="541" end_page="542" type="metho">
    <SectionTitle>
6 Tabular Deduction
</SectionTitle>
    <Paragraph position="0"> A simple algorithm for use with the above approach, which avoids much redundant computation, is as follows. Given a possible theorem to prove, the results of compilation (i.e. indexed types plus semantics) are gathered on an agenda. Then, a loop is followed in which an item is taken from the agenda and added to the database (which is initially empty), and then the next triple is taken from the agenda and  so on until the agenda is empty. Whenever an entry is added to the database, a check is made to see if it can combine with any that are already there, in which case new agenda items are generated. When the agenda is empty, a check is made for any successful overall analyses. Since the result of a combination always bears an index set larger than either parent, and since the maximal index set is fixed at compilation time, the above process must terminate.</Paragraph>
    <Paragraph position="1"> However, there is clearly more redundancy to be eliminated here. Where two items differ only in their semantics, their subsequent involvement in any further deductions will be precisely parallel, and so they can be collapsed together. For this purpose, the semantic component of database entries is replaced with a unique identifer, which serves as a 'hook' for semantic alternatives. Agenda items, on the other hand, instead record the way that the agenda item was produced, which is either 'presupplied' (by compilation) or 'by combination', in which case the entries combined are recorded by their identifiers. When an agenda item is added to the database, a check is made for an entry with the same indexed type. If there is none, a new entry is created and a check made for possible combinations (giving rise to new agenda items). However, if an appropriate existing entry is found, a record is made for that entry of an additional way to produce it, but no check made for possible combinations. If at the end there is a successful overall analsysis, its unique identifier, plus the records of what combined to produce what, can be used to enumerate directly the proof terms for successful analyses.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML