File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/90/c90-3035_metho.xml

Size: 20,780 bytes

Last Modified: 2025-10-06 14:12:29

<?xml version="1.0" standalone="yes"?>
<Paper uid="C90-3035">
  <Title>Expressive Power of Grammatical Formalisms</Title>
  <Section position="4" start_page="195" end_page="195" type="metho">
    <SectionTitle>
I
2. The class of WFF of well-formed formulas of
</SectionTitle>
    <Paragraph position="0"> Example: FOL is more expressive than Horn Clauses, but the latter are easily computable (no function symbols in both cases).</Paragraph>
    <Paragraph position="1"> 3. Axioms of the theory; Example. Two theories expressed in the same language, such as L = (+ ,*,=, &lt; ,1), can have completely different properties. For instance the axiomatization of real numbers in L gives a decidable theory, and hence any formula written in L is either provable or disprovable from the axioms, But the axiomatization of natural numbers in the same language does not decide all formulas, i.e.</Paragraph>
    <Paragraph position="2"> there are formulas written in L which cannot be proved or disproved from the axioms.</Paragraph>
    <Paragraph position="3"> 4. Rules of inference; Example. With Modus Ponens A, A---,B // B one can prove more than just with the rule A I/ AvB.</Paragraph>
    <Paragraph position="4"> We shall deal mostly with 1,2 and 4, and allow 3 to correspond to a translation into a logical language of the context free part of a grammar. As we have already mentioned, the difference between TGs and MGs has something to do with 4.</Paragraph>
    <Section position="1" start_page="195" end_page="195" type="sub_section">
      <SectionTitle>
2.2 Context Free Theories
</SectionTitle>
      <Paragraph position="0"> Using granamars we can talk about which notions cml be defined, or expressed, thereby only in an intuitive sense. The reason for that lies in the fact that '%eing defined/definable&amp;quot; is a property of a predicate, therefore a proper language for studying expressive power of different grammars, and grammatical formalisms is logic. Then we can talk about non-defibility or definability of notions such as Passive(x) or Passive of(x,y) formally, in a logical system. It turns out that definability of such a concept depends on a formal language in which a grammar is written. Thus it may happen that two grammars prove &amp;quot;sentencehood&amp;quot; of the exactly the same classes of strings, but it is possible to define such a predicate in one of those theories and not in the other.</Paragraph>
      <Paragraph position="1"> We begin with CFGs. Since we are going to be concerned with definability, we first translate CFGs into CFTs (Context Free Theories). The translation works as follows. A CF production like</Paragraph>
      <Paragraph position="3"> that all the variables are different); H ~ a.G and H --* a.b are replaced by tl(a.x) ~ G(x) and H ( a. b ) , respectively.</Paragraph>
      <Paragraph position="4"> For the sake of the uniformity of notation we will represent a rule of the form H,--B~&amp;B= as \[H, \[Bi, B=\]\]. (The the rule of substitution will correspond to the resolution).</Paragraph>
      <Paragraph position="5"> Proposition translation.</Paragraph>
      <Paragraph position="6"> proves S(s).</Paragraph>
      <Paragraph position="7"> I. Let G be a CFG and G' its A string s belongs to L(G) iff G'</Paragraph>
    </Section>
    <Section position="2" start_page="195" end_page="195" type="sub_section">
      <SectionTitle>
2.3 The Undefinability of Agreement
</SectionTitle>
      <Paragraph position="0"> We turn to a simple example, namely, agreement between NPs and VPs. Consider a sample CFT:</Paragraph>
      <Paragraph position="2"> In order to prove that agreement cannot be captured here, we need to specify what that would mean. It is easy to show that well-defined relations such as numberagree(number,x,y) or agree(number, x,y) are only definable in CFT if the defining formula.uses disjunction. But traditionally disjunction in grammatical description is a standard notation for two (or more) unrelated phenomena.</Paragraph>
      <Paragraph position="3"> Thus, disjunction is not forbidden, but when it occurs, it impfies the factual claim about the referents of the disjuncts are distinct linguistic phenomena. In the case before us, that would be saying that singular and plural agreement are not the same phenomenon. It is not, of course, the business of logic to inquire into whether in fact number agreement in some language is a unitary phenomenon. It is rather the business of' logic to provide the tools for the linguist scientist who, on whatever basis, makes such determinations, to capture formally the theories that he develops, Accordingly, we first assume a special notion of definability, defined as follows: Definition. A relation p is &amp;-definable in a CFT theory T if there is a formula  (*) /'(x,, ... ,x,,) ,-- &amp; ... &amp; &amp;.</Paragraph>
      <Paragraph position="4"> B(x, x i .... ) &amp; ...</Paragraph>
      <Paragraph position="5">  s.t. the tuples (a, .... , a~), which can be proven from T+ (*) to satisfy P(...), are exactly those belonging to the relation p.</Paragraph>
      <Paragraph position="6"> A category c is &amp;-definable in a CFT theory T if the one argument relation corresponing to c is. ltowever, some categories ttms defined are spurious in that they cannot be used in proving sentencehood. We want to rule these out.</Paragraph>
      <Paragraph position="7"> I.et Lnf(G) be the language that contains only those categories which appear in the formulas \[S,\[...\]\] which are derivable in. (the CFT con'esponding to) the grammar. From now on, by (&amp;-)definability we shall understated the (&amp;-)definability in L~G). Moreover, the notion of 'category' will be analogously restricted. Tiffs allows us to avoid spurious categories as in 2.4 below.</Paragraph>
      <Paragraph position="8"> We wil! also refer to constructions which are I.wo~place relations between a grammatical category (the category the construction yields) and a string ,ff grmnmatical categories (which the construction i:; made up of).</Paragraph>
    </Section>
    <Section position="3" start_page="195" end_page="195" type="sub_section">
      <SectionTitle>
2.4 Spurious categories
</SectionTitle>
      <Paragraph position="0"> It is possible to introduce a symbol that corresponds to the category NP, but it c,'mnot be used in deriving sentences from the stm't symbol.</Paragraph>
      <Paragraph position="1"> Our definition allows such spurious category symbols to appear in formulas of CFT, but at the same time it prevents them from having ~ly iHtluence on what categories are definable in the formalism. This simple example clearly shows that an appeal to intuitions would be insufficent to talk about expressive power of the two grammars.</Paragraph>
    </Section>
    <Section position="4" start_page="195" end_page="195" type="sub_section">
      <SectionTitle>
2.5 Some Results
</SectionTitle>
      <Paragraph position="0"> Theorem 2. Ira CFT T contains S(x.y) ~- NI'I - atl(x) &amp; VPt - ah(y) S(x.y) ~ NP2 - att(x) &amp; VP2 -- at,(y) and both NP~-.~ at,(x) and VP, - ale(x) satisfiable for both i. Then the relation agreement agree(aq,x,y) is not &amp;-definable. are of Proposition 3. The relation agree(number,x,y) is not &amp;-definable in the above CFT.</Paragraph>
      <Paragraph position="1"> Theorem 3. The category NP is not &amp;-definable or  definable tbr this language in CFT. Notice that linguists often do allow categories such as NP, but not constructions (such as the subject-predicate construction or passive) or features of constructions (such as agreement) to be described disjunctively. This is especially true of lexical categories. It is thus instructive that, as shown, the phenomenon of agreement means that certain categories are not definable at all, even with disjunction.</Paragraph>
      <Paragraph position="2">  3. Attributes and Constructions  We will talk now about a logic corresponding to CFGs with attributes. We will show that such logics provide an inherently more expressive theory of categories, ahd in particular allow us to define the category NP and VP in a language with number or gender agreement.</Paragraph>
      <Paragraph position="3"> However, the use of attributes does not lead to an all-powerful theory of constructions, ,and consequently certain linguistic generalizations are missed by grammars with attributes. This leads to the introduction of more powerful devices. All the formal languages we consider, if not first order, can be formalized as weak second order system in an obvious way.</Paragraph>
    </Section>
    <Section position="5" start_page="195" end_page="195" type="sub_section">
      <SectionTitle>
3.1 Word Order and Selection Variation
</SectionTitle>
      <Paragraph position="0"> Attribute theories clearly cannot treat as a single construction two forms with different word order.</Paragraph>
      <Paragraph position="1"> That is, they cannot &amp;-define a relation R(Cat,x), where Cat is some grammatical category and x ranges over a set of strings of categories identical except with respect to word order. This defect can be remedied by allowing an ID/LI' tbrmat for rules, which we formalize in a very similar way.</Paragraph>
      <Paragraph position="2"> (Details omitted.) Iiowever, this formalism if the variants with different orders do not differ in any other way. What is more interesting is what happens with ex;unples in which the choice of  attributes for some element is correlated with a choice of word order. For example, English has a slightly different class of verbs in &amp;quot;inverted&amp;quot; sentences than others, e.g. Aren't I smart? vs. *I aren't smart. There is no way to connect V \] with the V&lt; NP order, as opposed to + #wert V - invert\] and the NP &lt; V order. It was precisely l to handle cases like this that metarules were introduced in GPSG model of grammar, and it is one of the reasons for transformations as well.</Paragraph>
      <Paragraph position="3"> By parmnetrizing word order, we can capture the word order phenomena like those in English inverted sentences. Thus, we could have a grammar with a parametrized concatenation operator cone with values, such that 0 eonc(a,b,c)--_ abc, whereas 1 conc(a,b,c) =bac, for example. We can now state a single rule of the form conc(NP,..\[ AnvUX\], VP) ....... j to handle the subject-aux inversion facts.</Paragraph>
      <Paragraph position="4"> requires a formal language with something like the Kleene star, and require more space than we have in this paper. The idea is roughly this: We have introduced a set of sublanguages of Lnf(G) to avoid spurious categories. Now, we make one more restriction: let LnJ(NP) be the language that contains only those categories except NP which appear in the RHS of the formulas \[NP, \[ RIfS \]\] wtfich are derivable in (the CFT corresponding to) the grammar. Similarly, for other categories/symbols. The (&amp;-)definability of a construction X are defined as for other relations, except that the defining formula must belong to Z.n/(ag.</Paragraph>
      <Paragraph position="5"> Theorem 4. Constructions with two variants which differ by the order or number of constituents are not &amp;-definable in attribute grammars.</Paragraph>
      <Paragraph position="6"> Theorem 5. Constructions with two variants which differ by the order or number of constituents together with a difference in some other element are not &amp;-definable in attribute grammars with ID/LP and parentheses.</Paragraph>
      <Paragraph position="7"> Attribute theories also cannot handle variation in selection, i.e., the arity of a construction. This is easily remedied by formalizing the parenthesis notation of BNF which is often used to abbreviate CFG's--when we write, itfformally, a rule like A-,B((~, for example. It is harder, to handle the correlation between some attribute of one element and the presence or absence of some other element. For instance, many analyses of English postulate separate constructions of the VP depending on the class of the verb, e.g., transitive (V NP), ditransitive (V NP NP), transitive- prepositional (V NP PP), and so on. It has also been observed that each of these corresponds to a passive form in which one NP is missing (although a PP of the form by NP is optionally possible instead, this is irrelevant for our purpose). Again, if it were just the presence or absence of the object NP that distinguished the two voices, we could use the parenthesis device. However, the form of the verb also changes from active (e.g. sees) to passive (e.g. is seen). Such phenomena, which can be handled with metarules or transformations, also cannot be handled with attribute grammars.</Paragraph>
      <Paragraph position="8"> The problem of the definability of constructions is a more complicated one. The results below have been obtained for formulas of the standard first order language. A correct account of recursion Now, consider an extension of attribute grammars which parametrizes the presence/absence of constituents. Thus, we write rules like, where ont is a parameter controling 'the appearance or absence of an element (i.e., +ont(X) means X appears, -ont(JO that it does not), V VP-~ \[c~ active\] ~ ont(NP) Now we can &amp;-define the different kinds of transitive constructions, by using ont to control whether the object NP is realized (in the active) or null (in the passive).</Paragraph>
      <Paragraph position="9"> Now, the use of this device allows us to &amp;-define the three different kinds of transitive constructions, but not the passive construction, which still requires disiunction (for the same reason that the active requires disjunction). This is exactly the same as with metarules and transformations (as we will see below). The formalism provides no way of making the verb class attributes (trans, ditrans, trans prep) agree with the number and kind of constituent to the right of the verb (NP vs. NP NP vs. NP PP). Some linguists don't mind this, but we will show below how that can also be done (what is required is a way of making the verb class attribute agree with the number and kind of constituent following the verb). In order to &amp;-define passive, we would need a more powerful 198 4 kind of' parameter, which can control the number and kind of constituents, which we call sel VP-~ X sel And combining the two (ont and sel), we can describe both the transitive and the passive by a</Paragraph>
      <Paragraph position="11"> However, historically such devices as cone, ont, and sel have been unavailable, and instead, transformations and metarules have been used to obtain essentially the same effect, llence, we proceed to show how the power of these models can be represented 4. Derivability, WGs and M-grammars \[S(x.y), \[NP(x), VP(y)\]\] \[NP(u.w), \[ADJ(u), NP(w)\]\] \[S(u.w.y), \[\[NP(u.w), \[ADJ(u), NP(w)\]\], VP(y)\]\]. Clearly, with this rule we can prove about a string that it is an S if it is generated by the corresponding context free grammar. In a natural way we can extend this definition of an inference rule to cover attribute grammars: attributes can be treated simply as constraints. A metarule in an M-grammar such as</Paragraph>
      <Paragraph position="13"> which relates passive and active, can be treated as an inference rule.</Paragraph>
      <Paragraph position="14"> \[VP(xy), \[X(x), NPO')\]\] \[VP(x.z)(PAs), IX(x), m,(t,y.z)\]\] A transformation in a TG can be understood exactly the s,-une way, as a rule Treel//7'ree2. We will consider very simple kinds of TGs and MGs, which operate on (sub)trees of depth one. This is enough to capture GPSG use of metarules, but not the fifll power of conventional TG. The more general model will be discussed briefly, but for our purposes it is more convenient sometimes to consider special cases, which make the demonstrations simpler.</Paragraph>
      <Paragraph position="15"> The TGs and gr,'unmars with metarules (M-Gs) deal not only with strings but a!so with trees. To compare them we have to use a common formalism. Let q&amp;quot; be a collection of trees (over some alphabet with terminals, non-terminals, and perhaps other symbols), where each tree is a pair \[Node, Sons\], where Sore is a list of trees. &amp;quot;Iqaen each rule of a context free grammar can be represerrted as a tree of depth one (the definition of depth being obvious), e.g.</Paragraph>
      <Paragraph position="16"> \[S(x.y), \[NP(x), VP(v)\]\], or \[N(dog), \[(p\]\]. We need now to establish the following interpretation: * Trees, as defined above, will be intepreted as formulas; * The rules of proof will be expressed in the Gentzen style: Tree l,Tree2//Tree3 ; * One of the rules of proof will be substitution, as ill The difference between TG and grammars with metarules (M-Gs) can be now expressed in the definition of a proof.</Paragraph>
      <Paragraph position="17"> * In the case of TGs a proof of a formula I~ is a sequence (P, Q) where 1. P is a sequence of formulas f\] ..... lZk such that each F~ is a fommla corresponding to a context free rule of the grammar or is obtained from \[}, (j&lt; i) by the rule of substitution, and l;k is \[S(...), \[RIlS\]\], where RIIS contains only the terminal symbols (i.e. Fk represents a fully expanded tree).</Paragraph>
      <Paragraph position="18"> 2. Q is a sequence Fk, .,,1~, where each formula is obtained from the previous one by a rule corresponding to a transformation.</Paragraph>
      <Paragraph position="19"> * For M-Gs a proof of a formula I'~, is a sequence (PI, Q, P) where  1. PI is a collection of formulas of depth one; 2. Q -- formulas/trees obtained by applying metarules; 3. P -- formulas/trees obtained by applying  the rule of substitution.</Paragraph>
      <Paragraph position="20"> These definitions allow us to show that a variety of constructions not allowable by CFG's or attribute grammars are definable by MG's and TG's. Also, we obtain a rather neat characterization of' the similarities and differences between TG's ,and MG's.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="195" end_page="195" type="metho">
    <SectionTitle>
5 199
5. Other Notions of Definability
</SectionTitle>
    <Paragraph position="0"> The notion of &amp;-definability of categories and constructions is not the only one that we could have employed. One alternative would be to look at what relations are definable. For instance, Sells (p.93) remarks that it is impossible to express Subject-Subject- Raising as a metarule. From our perspective this of course still holds true, but we can also see that one can easily define a relation SSR(t,,t2) which holds only if the first tree has been obtained by subject raising from the ~econd tree:</Paragraph>
    <Paragraph position="2"> (&amp;quot;seem&amp;quot; stands for all verbs of this class).</Paragraph>
    <Paragraph position="3"> Another extension would be to consider definability of categories, constructions, or relations not for a single grammar but across an (itffmite) class of grammars. For example, we have seen that in a language of CFT a definition of agreement can be given only by a disjunctive formula, but it can be given, so long as we confine ourselves to a single grammar.</Paragraph>
    <Paragraph position="4"> However, in linguistics there is precedent for (informal) arguments that some notion, while definable for each grammar of a class, cannot be defined for the whole class. Ttfis idea leads to a new notion of definability: Definition. A property Pr is definable over/across a class of theories C , if there is a formula F(x) s.t. for any T in C, and any term t of the language, we have Pr(t) iff T+ F(x) proves P(t).</Paragraph>
    <Paragraph position="5"> Similarly, one defines &amp;-definabality across C. ( Pr(t) means that it holds in any model of 7).</Paragraph>
    <Paragraph position="6"> Let's concentrate on the agreement wrt gender.</Paragraph>
    <Paragraph position="7"> Notice that the assumption that agreement exists and that it should be somehow expressible in a grammar (using disjunctions or not) is an empirical statement. We could express it formally by augmenting our language with a (higher order) device detecting the presence of a substring 'fern' in a predicate as in S(x.y) ~ NP - sing -fern(x) &amp; VP - sing -fern(y). We will be interested in a subclass of CFT that allow formulas of the sort: S(x.y) ~ NP - at! - a h - ...at N-fern(x) &amp; VP- at 1 - ...at N - fem(y) Assuming that we can talk about agreement formally, we can formulate and prove the following fact.</Paragraph>
    <Paragraph position="8"> Theorem 6. Agreement with respect to an attribute is not definable across the class CFT of context free theories.</Paragraph>
    <Paragraph position="9"> We now see that much stronger results can be achieved across languages than for one language. For a single CFT, we can only that agreement is not &amp;-definable, but for the CFT's as a class we have just seen that it is not defmable at all.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML