XML Viewer - w05-1503

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/05/w05-1503_metho.xml
Size: 28,636 bytes
Last Modified: 2025-10-06 14:09:58
<?xml version="1.0" standalone="yes"?>
<Paper uid="W05-1503">
  <Title>Switch Graphs for Parsing Type Logical Grammars[?]</Title>
  <Section position="4" start_page="0" end_page="23" type="metho">
    <SectionTitle>
2 Lambek's Associative Calculus
</SectionTitle>
    <Paragraph position="0"> Lambek's associative calculus L (Lambek 1958) contains three connectives: concatenation, left division, and right division. Logically, concatenation is conjunction and the divisions are directed implications. Algebraically, concatenation is a free semigroup product and the divisions its left and right residuals. Viewed as a purely syntactic formalism, L assigns syntactic types to linguistic expressions modeled as sequences of tokens. From a stipulated lexical assignment of expressions to syntactic types, further assignments of expressions to types are derived through purely logical inference, with the logic representing a sound and complete axiomatization and inference system over the algebraic structure (Pentus 1995).</Paragraph>
    <Paragraph position="1"> L appears near the bottom of a hierarchy of substructural logics obtained by dropping structural rules: Lambek proofs are valid as multiplicative intuitionistic linear proofs (restoring permutation) which are valid as conjuntive and implicative relevance proofs (restoring contraction) which are valid as conjuntive and implicative intuitionistic proofs (restoring weakening). In type logical grammars, lexical entries are associated with syntactic types and intuitionistic (in fact probably relevant) proofs as semantic representations, notated as terms of the simply typed l-calculus with product, under the Curry-Howard correspondence. The semantics of a  derived expression is the result of substituting the lexical semantics into the reading of the derivation as an intuitionistic proof.</Paragraph>
    <Section position="1" start_page="18" end_page="18" type="sub_section">
      <SectionTitle>
2.1 Syntactic and Semantic Types
</SectionTitle>
      <Paragraph position="0"> The set of syntactic types is defined recursively on the basis of a set SynAtom of atomic syntactic types.</Paragraph>
      <Paragraph position="1"> The full set SynTyp of syntactic types is the least set containing the atomic syntactic types SynAtom and closed under the formation of products (SynTyp* SynTyp), left divisions (SynTyp\SynTyp), and right divisions (SynTyp/SynTyp). The two division, or &amp;quot;slash&amp;quot;, types, A/B, read A over B, and B\A, read B under A, refine the semantic function types by providing a directionality of the argument with respect to the function. A linguistic expression assigned to type A/B combines with an expression of type B on its right side to produce an expression of type A. An expression of type B\A combines with an expression of syntactic type B on its left to produce an expression of type A. The product syntactic type A*B is assigned to the concatenation of an expression of type A to an expression of type B. The distinguishing feature of Lambek calculus with respect to the earlier categorial grammar of Bar-Hillel is that as well as the familar cancelation (modus ponens) rules, it admits also a form of the deduction theorem: if the result of concatenating an expression e to each B results in an expression of type A, then it follows that e is assigned to syntactic type A/B.</Paragraph>
      <Paragraph position="2"> Semantic representations in Lambek type logical grammar are simply typedl-terms with product. We assume a set SemAtom of atomic semantic types, which generate the usual function types s - t and product types sxt. Terms are grounded on an infinite set of distinct variables Vars, along with a set of distinct contants Cons for each type s. We assume the usual l-terms consisting of variables, constants, function applications a(b), function abstractions lx.a, pairs &lt;a,b&gt; and projections from pairs pi1d and pi2d onto the first and second element of the pair respectively. We say that a term a is closed if and only if it contains no free variables.</Paragraph>
      <Paragraph position="3"> A type map consists of a mapping typ : SynAtom - SemTyp. That is, each atomic syntactic type A [?] AtomCat is assigned to a (not necessarily atomic) semantic type typ(A) [?] SemTyp. Semantic types are assigned to complex syntactic types as follows:</Paragraph>
      <Paragraph position="5"> We will often writea : A whereais al-term of type typ(A).</Paragraph>
    </Section>
    <Section position="2" start_page="18" end_page="18" type="sub_section">
      <SectionTitle>
2.2 Linguistic Expressions and the Lexicon
</SectionTitle>
      <Paragraph position="0"> In the Lambek calculus, linguistic expressions are modeled by sequences of atomic symbols. These atomic symbols are drawn from a finite set Tok of tokens. The full set of linguistic expressions Tok[?] is the set of sequences of tokens. For the sake of this short version of the paper we admit the empty sequence; we will address its exclusion (as in the original definition of L) in a longer version.</Paragraph>
      <Paragraph position="1"> The compositional assignment of semantic terms to linguistic expressions is grounded by a finite set of assignments of terms and types to expressions.</Paragraph>
      <Paragraph position="2"> A lexicon is a finite relation Lex [?] Tok[?] x Term x SynTyp, where all &lt;w,a,A&gt; [?] Lex are such that the semantic term a is of the appropriate type for the syntactic type A. We assume that the only terms used in the lexicon are relevant, in the sense of relevance logic, in not containing vacuous abstractions. Note that the set of atomic semantic types, atomic syntactic types and the semantic type mapping are assumed as part of the definition of a lexicon. Type logical grammar is an example of a fully lexicalized grammar formalism in that the lexicon is the only locus of language-specific information.</Paragraph>
    </Section>
    <Section position="3" start_page="18" end_page="19" type="sub_section">
      <SectionTitle>
2.3 Proof Nets
</SectionTitle>
      <Paragraph position="0"> A sequent G = a : A is formed from an antecedent G consisting of a (possibly empty) sequence of l-term and syntactic type pairs, and a consequent pair a : A, where the terms are of the appropritate type for the types. Following Roorda (1991), we define theoremhood with Girard-style proof nets (Girard 1987), a geometric alternative to Lambek's Gentzenstyle calculus (Lambek 1958).</Paragraph>
      <Paragraph position="1"> Proof nets form a graph over nodes labeled by polar types, where a polar type is the combination of a syntactic type and one of two polarities, input (negative) and output (positive). We write A* for the input polar type, which corresponds to antecedent types and is thus logicaly negative. We write A* for  the output polar type, which is logically positive and corresponds to a consequent type. A literal is a polar type with an atomic syntactic type. Where A is an atomic syntactic type, the literals A* and A* are said to be complementary.</Paragraph>
      <Paragraph position="2"> Each polar type defines an ordered binary tree rooted at that polar type, known as a polar tree. For a literal, the polar tree is a single node labeled by that literal. For polar types with complex syntactic types, the polar tree is rooted at the polar type and unfolded upwards based on connective and polarity according to the solid lines in Figure 1, which includes also other annotation. Examples for some linguistically motivated types are shown in Figure 2.</Paragraph>
      <Paragraph position="3"> The solid edges of the graphs are the edges of the logical links. Each unfolding is labeled with a multiplicative linear logic connective, either multiplicative conjunction ([?]) or multiplicative disjunction (P). This derives from the logical interpretation of the polar type trees as formula trees in multiplicative linear logic. Unfolding the Lambek connectives to their linear counterparts, (A/B)* and (B\A)* unfold to A*PB*; (A/B)* and (B\A)* unfold to A*[?]B*; (A * B)* unfolds to A* [?] B*; and (A * B)* unfolds to A*PB*. The type unfoldings correspond to the classical equivalences between (ph - ps) and (!ph [?] ps), between !(ph - ps) and (ph [?] !ps), and between !(ph[?]ps) and (!ph[?] !ph). For atomic syntactic types A, A* becomes simply A, whereas A* becomes its linear negation A[?]; this is the sense in which polar atomic types correspond to logical literals. The non-commutatitive nature of the Lambek calculus is reflected in the ordering of the subtrees in the unfoldings; for commutative logics, the proof trees are not ordered.</Paragraph>
      <Paragraph position="4"> The proof frame for a syntactic sequent C1,...,Cn = C0 is the ordered sequence of polar trees rooted at C*0,C*1,...,C*n. We convert sequents to frames in this order, with the output polar tree first. In general, what follows applies to any cyclic reordering of these polar trees. Note that the antecedent types C1,...Cn have input (negative) polarity inputs and the consequent type C0 has output (positive) polarity. All of our proof frames are intuitionistic in that they have a single output conclusion, i.e. a unique polar tree rooted at an output type.</Paragraph>
      <Paragraph position="5"> A partial proof structure consists of a proof frame with a set of axiom links linking pairs of complementary literals with at most one link per literal. Axiom links in both directions are shown in Figure 3.</Paragraph>
      <Paragraph position="6"> A proof structure is a proof structure in which all literals are connected to complementary literals by axiom links.</Paragraph>
      <Paragraph position="7"> Proof nets are proof structures meeting certain conditions. A proof structure is planar if and only if its axiom links can be drawn in the half-plane without crossing lines; this condition enforces the lack of commutativity of the Lambek calculus. The final condition on proof structures involves switching. A switching of a proof structure is a subgraph that arises from the proof structure by removing exactly one edge from each disjunctive (P) link. A proof structure is said to be Danos-Regnier (DR-) acyclic if and only if each of its switchings is acyclic (Danos and Regnier 1989).1A proof net is a planar DR-acyclic proof structure. A theorem is any sequent forming the proof frame of a proof net.</Paragraph>
      <Paragraph position="8"> Consider the three proof nets in Figure 4. The first example has no logical links, and corresponds to the simplest sequent derivation S = S . The second example represents a determiner, noun and intransitive verb sequence. Both of these examples are acyclic, as must be every proof net with no logical P-links.</Paragraph>
      <Paragraph position="9"> The third example corresponds to the type-raising sequent N = S/(N\S ). Unlike the other examples, this proof net involves a P-link and is cyclic. But both of its switchings are acyclic, so it satisfies the Danos-Regnier acyclicity condition.</Paragraph>
    </Section>
    <Section position="4" start_page="19" end_page="23" type="sub_section">
      <SectionTitle>
2.4 Essential Nets and Semantic Trips
</SectionTitle>
      <Paragraph position="0"> A term is said to be pure if and only if it contains no constants. The linear terms are closed, pure l-terms that bind each variable exactly once. Each proof net in the Lambek calculus corresponds to a linear (i.e. binding each variable exactly once) l-term via the Curry-Howard correspondence. This term abstracts over variables standing in for the semantics of the inputs in the antecedent of the sequent and has a body that is determined by the consequent of the sequent. For instance, the l-term lx.lP.P(x)  be acyclic and connected. Fadda and Morrill (2005) show that for the intuitionistic case (i.e. single output conclusion, as for L), DR-acyclicity entails the connectedness of every switching.</Paragraph>
      <Paragraph position="2"/>
      <Paragraph position="4"> the sequent x : N = lP.P(x) : S/(N\S ). The l-term induced by the Curry-Howard correspondence can be determined by a unification problem over a proof net (Roorda 1991). Different proof nets for the same theorem correspond to different interpretations through the Curry-Howard correspondence. The essential net of a proof structure is the directed graph rooted at the root node of the output polar type tree whose edges are shown as dashed lines in Figures 1 and 3 (LaMarche 1994). Each output division type introduces a fresh variable on its input subtype (its argument), as indicated by the labels xi in Figure 1.</Paragraph>
      <Paragraph position="5"> The essential nets for the examples in Figure 4 are shown in Figure 5.</Paragraph>
      <Paragraph position="6"> Terms are computed for the polar type trees by assigning terms to the roots of the polar inputs. The tree is then unfolded unifying in substitutions as it goes, as illustrated in the example polar type trees in  tial net provide the substitutions necessary to solve the unification problem of l-terms in the proof net established by equating the two halves of each axiom linked complementary pair of literals. A traversal of an essential net carrying out the substitutions specified by axiom links constitutes a semantic trip the end result of which is the Curry-Howard l-term for the Lambek calculus theorem derived by the proof net. All l-terms derived from a semantic trip with variables or constants assigned to input root polar types will be in b-e long form. The essential net directly corresponds to the tree of the semantic term derived by the Curry-Howard correspondence.</Paragraph>
      <Paragraph position="7"> The well-formedness of a set of axiom linkings over a polar tree may be expressed in terms of the essential net. Among the conditions are that an essential net must be acyclic and planar. In addition, essential nets must be connected in two ways. First, there must be a path from the root of the single output polar tree to the root of each of the input polar trees. Second, there must be a path from each output daughter of an output division to the input daughter. That is, when A/B* is unfolded to B*A*, there must be a path from A* to B*. These conditions express the definition of linear semantic terms dictated through the logic by the Curry-Howard correspondence. The first condition requires each variable (or term) corresponding to the root of an input polar tree to occur in the output term, whereas the second condition requires that variables only occur within their proper scopes so that they are bound. The essential nets presented in Figure 5 adhere to these conditions and produce well-typed linear l-terms. The example presented in Figure 6 shows a set of axiom links that does not form a proof net; it violates the condition on variable binding, as is seen from the lack of path from the N* daughter to the N* daughter of the N/N* node. The major drawback to using these conditions directly in parsing is that they are existential in the sense of requring the existence of a certain kind of path, and thus difficult to refute online during parsing. In comparison, the Danos-</Paragraph>
      <Paragraph position="9"/>
      <Paragraph position="11"> Regnier acyclicity condition is violated by the attempt to close off the binding of the variable. The path vilolating DR acyclicity is shown in Figure 7, with the path given in dashed lines and the switching taking the right daughter of N/N* as the arc to remove.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="23" end_page="26" type="metho">
    <SectionTitle>
3 Parsing with Switch Graphs
</SectionTitle>
    <Paragraph position="0"> The planar connection of all literals into a proof structure is straightforward to implement. Axiom links are simply added in such a way that planarity is maintained until a complete linkage is found. In our shift-reduce-style parser, planarity is maintained by a stack in the usual way (Morrill 2000). For dynamic programming, we combine switch graphs in the cells in a Cocke-Kasami-Younger (CKY) parser (Morrill 1996). The main challenge is enforcing DR-acyclicity, and this is the main focus of the rest of the paper. We introduce switch graphs, which not only maintain DR-acyclicity, but also lead the way to a normal form for well-formed subsequence fragments of a partial proof structure. This normal form underlies the packing of ambiguities in subderivations in exactly the same way as usual in dynamic programming parsing.</Paragraph>
    <Section position="1" start_page="23" end_page="23" type="sub_section">
      <SectionTitle>
3.1 Switch Graphs
</SectionTitle>
      <Paragraph position="0"> Switch graphs are based on the observation that a proof structure is DR-acyclic if and only if every cycle contains both edges of a P-link. If a cycle contains both edges of a P-link, then any switching removes the cycle. Thus if every cycle in a proof structure contains both edges of aP-link, every switching is acyclic.</Paragraph>
      <Paragraph position="1"> The (initial) switch graph of a partial proof structure is defined as the undirected graph underlying the partial proof structure with edges labeled with sets of P-edge identifiers as indicated in Figures 1 and 3. Each edge in a logical P-link is labeled with the singleton set containing an identifier of the link itself, either Li for the left link of P-link i or Ri for the right link of P-link i. Edges of axiom links and logical [?]-links are labeled with the empty set.</Paragraph>
      <Paragraph position="2"> The closure of a switch graph is computed by iterating the following operation: if there is an edge n1 [?] n2 labeled with set X1 and an edge edge n2 [?] n3 labeled with set X2 such that X1[?]X2 does not contain both edges of a P-link, add an edge n1 [?] n3 labeled with X1[?]X2. An edge n[?]m labeled by X is subsumed by an edge between the same nodes n[?]m labeled by</Paragraph>
      <Paragraph position="4"> proof structure is derived by closing its the initial switch graph, removing edges that are subsumed by other edges, and restricting to the literal nodes not connected by an axiom link. These normal switch graphs define a unique representation of the combinatorial possibilities of a span of polar trees and their associated links in a partial proof structure. That is, any partial proof structure substructure that leads to the same normal switch graph may be substituted in any proof net while maintaining well-formedness.</Paragraph>
      <Paragraph position="5"> The fundamental insight explored in this paper is that two literals may be connected by an axiom link in a partial proof structure without violating DR-acyclicity if and only if they are not connected in the normal switch graph for the partial proof structure. The normal switch graph arising from the addition of an axiom link is easily computed. It is just the closure generated by adding the new axiom link, with the two literals being linked removed.</Paragraph>
    </Section>
    <Section position="2" start_page="23" end_page="25" type="sub_section">
      <SectionTitle>
3.2 Shift-Reduce Parsing
</SectionTitle>
      <Paragraph position="0"> In this section, we present the search space for a shift-reduce-style parsing algorithm based on switch graphs. The states in the search consist of a global stack of literals, a lexical stack of literals, the remaining tokens to be processed, and the set of links among nodes on the stacks in the switch graph. The shift-reduce search space is characterized by an initial state and state transitions. These are shown in schematic form in Figure 8. The initial state contains the output type's literals and switch graph. A lexical transition is from a state with an empty lexical stack to one containing the lexical literals of the next token; the lexical entry's switch graph merges with the current one. A shift transition pops a literal from the lexical stack and pushes it onto the global stack. A reduce transition adds an axiom link between the top of the global stack and lexical stack if they are complementary and are not connected in the switch graph; the resulting switch graph results from adding the axiom link and normalizing. The stack discipline insures that all partial proof structures considered are planar.</Paragraph>
      <Paragraph position="1"> Figure 10 displays as rows the shift-reduce search</Paragraph>
      <Paragraph position="3"> states corresponding to the two valid proof nets shown in Figure 9. The subscripts on syntactic types in the diagram is only so that they can be indexed in the rows of the table describing the search states.</Paragraph>
      <Paragraph position="4"> The initial state in both searches is created from the output type's literal. The third column of the diagrams indicate the token consumed at each lexical entry. The switch graphs are shown for the rows for which they're active. Because there are no Plinks, all sets of edges are empty. The fifth column shows the axiom linking made at each reduce step.</Paragraph>
      <Paragraph position="5"> The history of these decisions and lexical insertion choices determines the final proof net. Finally, the sixth column shows the operation used to derive the result. Note that reduction is from the top of the lexical stack to the top of the global stack and is only allowed if the nodes to be linked are not connected in the switch graph. This is why N*1 cannot reduce with N*2 in the second diagram in Figure 10; the second shift is mandatory at this point. Note that as active nodes are removed, as in the first diagram reduction step linking 0=2, the switch graph contracts to just the unlinked nodes. After the reduction, only N*2 is unlinked, so there can be no switch graph links. The link between node 4 and 5 is similarly removed almost as soon as it's introduced in the second reduction step. In the second diagram, the switch graph links persist as lexical literals are pushed onto the stack.</Paragraph>
      <Paragraph position="6"> Shift-reduce parses stand in one-to-one correspondence with proof nets. The shift and reduce operations may be read directly from a proof net by working left to right through the literals. Between literals, the horizontal axiom links represent literals on the stack. Literals in the current lexical syntactic type represent the lexical stack. Literals that are shifted to the global stack eventually reduce by axiom linking with a literal to the right; literals that are reduced from the lexical stack axiom link to their left with a literal on the global stack.</Paragraph>
    </Section>
    <Section position="3" start_page="25" end_page="26" type="sub_section">
      <SectionTitle>
3.3 Memoized Parsing
</SectionTitle>
      <Paragraph position="0"> Using switch graphs, we reduce associative Lambek calculus parsing to an infinite binary phrase-structure grammar, where the non-terminals are normalized switch graphs. The phrase structure schemes are shown in Steedman notation in Figure 11. Lexical entries for syntactic type A are derived from the input polar tree rooted at A*. This polar tree yields a switch graph, which is always a valid lexical entry in the phrase structure grammar.</Paragraph>
      <Paragraph position="1"> Any result of axiom linking adjacent complementary pairs of literals in the polar tree that maintains switch-graph acyclicity is also permitted. For instance, allowing empty left-hand sides of sequents, the input type A/(B/B)* would produce the literals A*1B*2B*3 with links 1-2 : {L3},1-3 : {R3}. This could be reduced by taking the axiom link 2=3, to produce the single node switch graph A*1. In contrast, (B/B)/A* produces the switch graph B*1B*2A*3 with links 1-2, 1-3, and 2-3. Thus the complementary B literals may not be linked.</Paragraph>
      <Paragraph position="2"> Given a pair of normal switch graphs, the binary rule scheme provides a finite set of derived switch graphs. One or more complementary literals may be axiom linked in a nested fashion at the borders of both switch graphs. These sequences are marked as [?] and [?] and their positions are given relative to the other literals in the switch graph in Figure 11. Unlinked combinations are not necessary because the graph must eventually be connected. This scheme is non-deterministic in choice of [?]. For instance, an adverb input (N1\S 2)/(N4\S 3)* produces the literals N*1S *2S *3N*4 and connections 1-2, 1-3:{L4}, 1-4:{R4}, 2-3:{L4}, and 2-4:{R4}. When it combines with a verb phrase input N5\S *6 with literals N*5S *6 and connections 5-6, then either the nominals may be linked (4=5), or the nominals and sentential literals may be linked (4=5, 3=6). The result of the single linking is N*1S *2S *3S *6 with connections 1-2, 1-3:{L4}, 1-6:{R4}, 2-3:{L4}, and 2-6:{R4}. The result of the double linking is simply N*1S *6 with connection 1-6, or in other words, a verb phrase.</Paragraph>
      <Paragraph position="3"> The dynamic programming equality condition is that two analyses are considered equal if they lead to the same normalized switch graphs. This equality is only considered up to the renaming of nodes and edges. Backpointers to derivations allow semantic readings to be packed in the form of lexical choices and axiom linkings. For instance, consider the two parses in Figure 12.</Paragraph>
      <Paragraph position="4"> With a finite set of lexical entries, bottom-up memoized parsing schemes will terminate. We illustrate two derivations of a simple subject-verb-object construction in Figure 13. This is a so-called spurious ambiguity because the two derivations produce</Paragraph>
      <Paragraph position="6"> the same semantic term. They are not spurious globally because the alternative linkings are required for adverbial modification and object relativization respectively. The ambiguity in the phrase structure grammar results from the associativity of the combination of axiom linkings. The two derivations do not propagate their ambiguity under the dynamic prorgramming scheme precisely because they produce equivalent results. Nevertheless, a worthwhile optimization is to restrict the structure of combinations of linkings in the phrase-structure schemes to correspond to an unambiguous left-most linking strategy; this corresponds to the way in which other associative operators are parsed in programming language.</Paragraph>
      <Paragraph position="7"> For instance, x+y+z will be assumed to be x+(y+z) if + is defined to be right associative.</Paragraph>
      <Paragraph position="8"> An unambiguous right-associative context-free grammar for linkings M over literals A and their complements A is:</Paragraph>
      <Paragraph position="10"> An example of packing for subject/object scope ambiguities is shown in Figure 14. The derivations in Figure 14 produce different semantic interpretations; one of these is subject-wide scope and the other object-wide scope. Unsurprisingly, the memoizing parser does not solve P = NP in the affirmitive (Pentus 2003). The size of the switch graphs on the intermediate structures is not bounded, nor is the number of alternative switch-paths between literals.</Paragraph>
      <Paragraph position="11"> It remains an open question as to whether the switch graph parser could be bounded for a fixed lexicon (Pentus 1997).</Paragraph>
    </Section>
    <Section position="4" start_page="26" end_page="26" type="sub_section">
      <SectionTitle>
3.4 Empty Antecedents and Subtending
</SectionTitle>
      <Paragraph position="0"> Lambek's calculus required the antecedent G in a sequent G = a : A to be non-empty.</Paragraph>
      <Paragraph position="1"> Proof nets derive theorems ( = CN/CN) and ((CN/CN)/(CN/CN) = CN/CN), as shown in Figure 15. These derivations both allow the construction of an output, namely the identity term lx.x and modifier syntactic type CN/CN, out of no input.</Paragraph>
      <Paragraph position="2"> A literal A is said to subtend a complementary literal A if they are the leftmost and rightmost descendants of a P-link. In both of the examples in Figure 15, the output adjective CN/CN* unfolds to the sequence of literals CN*CN* in which the input CN* subtends the output CN*. If literals that stand in a subtending relation are never linked, the set of theorems is restricted to those derivable in Lambek's original calculus.</Paragraph>
      <Paragraph position="3"> Consider the proof net in Figure 16. An analysis in which S *8 linked to S *11 and N*9 linked to N*10 is not ruled out by Danos-Regnier acyclicity. It is ruled out by the subtending condition because S *8 subtends S *11, being the leftmost and right most daughters of the P-node (N10\S 11)\(N9\S 8)*. Further note that there are no cycles violating DR acyclicity; each of the sixteen switchings is acyclic.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML