File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/91/j91-3002_metho.xml
Size: 62,740 bytes
Last Modified: 2025-10-06 14:12:41
<?xml version="1.0" standalone="yes"?> <Paper uid="J91-3002"> <Title>Chinese Number-Names, Tree Adjoining Languages, and Mild Context-Sensitivity</Title> <Section position="3" start_page="278" end_page="279" type="metho"> <SectionTitle> 2 Incidentally, Merriarn-Webster's Third New International Dictionary of the English Language (in the </SectionTitle> <Paragraph position="0"> Fifteenth Edition of Encyclopedia Britannica) has a table on p. 1549 labeled &quot;DENOMINATIONS ABOVE ONE MILLION.&quot; This table contains entries up to 'centillion,' which happens to be 103o3 in the American system and 1036 in the British one. The number 1036 is expressed in the American system as 'undecillion.' Computational Linguistics Volume 17, Number 3 matter of performance could possibly utter enough trillions in sequence to make a counting expression expressing ten to the trillionth power, but no grammatical theory need concern itself with this fact.</Paragraph> <Paragraph position="1"> While we fully agree with Hurford on this matter, the contrived nature of &quot;strungtogether&quot; trillions or thousands or any other single-word number-name seems to lead some investigators into rejecting its empirical validity in English. For example, Brainerd (1971, p. 208) mentions the following: The collection of numerical expressions in most languages, as in English, are basically finite. Thus in English we must ultimately coin new 'illions' if we are to transcend our \[finite\] system of number names. And where are these to come from when we have run out of Latin prototypes? 3 In some languages there are, however, purely linguistic devices which allow for an infinitude of numerical expressions. For example, in Chinese, wan is used for 104 and wan wan for 108 . Presumably we can continue ad infinitum, wan wan wan 1012, wan wan wan wan 1016 , etc.</Paragraph> <Paragraph position="2"> Thus, while the linguistic status of strung-together number-names in English might be questionable, Brainerd suggests on the basis of empirical evidence that this is not so in Chinese. Characteristic (i) appears then to be linguistically warranted in at least one NL, currently the most widely natively spoken one. 4 In the next section, we take a look at some more data on number-names in Chinese. This will help us to support our conclusion regarding the linguistic reality of characteristic (i) as well as to determine the empirical status of characteristic (ii). Furthermore, the data will serve as a basis for the formal arguments to be presented in Sections 4 and 5. In Section 6, we shall return to discuss the linguistic status of characteristic (ii).</Paragraph> </Section> <Section position="4" start_page="279" end_page="279" type="metho"> <SectionTitle> 3. Data 5 </SectionTitle> <Paragraph position="0"> The number-name in Chinese for '10,000' is wan. The number-name for '100,000,000' is either wan wan or yi. 6 Contrary to Brainerd's (1971) presumption, some native speakers find longer number-names such as wan wan wan for 1012 or wan wan wan wan for 1016 awkward due to the existence of the numerically higher-valued single-word number-</Paragraph> </Section> <Section position="5" start_page="279" end_page="279" type="metho"> <SectionTitle> 3 Alexis Manaster-Ramer (p.c.) has provided an answer to this question: These are to come from </SectionTitle> <Paragraph position="0"> indefinite iteration. In other words, once we have reached the limit of Latin prototypes, we can still form new number-names by stringing together Latin prototypes word-internaUy. In fact, such a number-word-formation process would be a morphological analogue of (i) at the single-word level.</Paragraph> </Section> <Section position="6" start_page="279" end_page="279" type="metho"> <SectionTitle> 4 We also find the following claims in the literature: (a) Menninger (1969, p. 129) reports: &quot;The Gothic </SectionTitle> <Paragraph position="0"> word for 'hundred'... is not hund but is represented by two full word forms: taihun-taihund, 'a ten of tens,' in which the grouping 'ten' is counted.&quot; Nevertheless, since Gothic is dead, our knowledge of it is limited to a finite nonproductive corpus of items. From this one example, we cannot conclude that the Gothic cardinal number-name system permitted an indefinite amount of concatenations of number names. (b) Brainerd (1967, p. 43) reports: (due to Gr. C. Moisil) &quot;It has been suggested that by using the expressions milioane de milioane, milioane de milioane de milioane, etc. a name can be given for every natural number \[in Rumanian\].&quot; Strung-together number-names in Rumanian sound less awkward due, most likely, to the phonetically realized separator de between the single-words. Yet, while such expressions sound fine, Rumanian has no direct analogue of Chinese's wan wan, in the sense of a commonly-used strung-together number-name. As will be shown in Section 3, wan wan is a perfectly common and natural name for 108 in Chinese.</Paragraph> </Section> <Section position="7" start_page="279" end_page="289" type="metho"> <SectionTitle> 5 For elaborations on Chinese names for cardinal numbers beyond what we present here, see, for </SectionTitle> <Paragraph position="0"> example, Chao (1968, pp. 567-575) or Henne et al. (1977, pp. 239-242). For grammars generating Chinese number-names, see Brainerd (1966a) and Brainerd and Peng (1968). For a historical survey, see Needham (1959, pp. 1-90).</Paragraph> <Paragraph position="1"> 6 When used as the number-name for '100,000,000,' yi is usually pronounced in the fourth tone. If used as the number-name for '1,' it is pronounced in the first tone.</Paragraph> <Section position="1" start_page="280" end_page="282" type="sub_section"> <SectionTitle> Radzinski Chinese Number-Names </SectionTitle> <Paragraph position="0"> name yi. Thus, 1012 is more naturally expressed as wan yi and 1016, as yi yi. Yet there exists a single-word number-name with a higher numerical value than yi. This single-word is zhao, meaning 1012, i.e., Amer. 'trillion.' This number-name appears to be the numerically highest-valued single-word number-name in use in the modern language. 7 The most natural way of linguistically expressing a number exponentially greater than 'trillion'--as &quot;natural&quot; as one gets with such high numbers--is by stringing together instances of zhao. This is the same tool used for expressing 108 as wan wan, which is, as previously mentioned, a frequently used synonym for yi. Thus, the unbounded stringing-together of instances of zhao is simply an extended instantiation of a method frequently used in Chinese for expressing a more common lower-valued number.</Paragraph> <Paragraph position="1"> Examples (a) and (b) below are well-formed Chinese number-names, while example (c) is not: 8 (a) wu zhao zhao wu zhao five trillion trillion five trillion (i.e. 5,000,000,000,000,005,000,000,000) (b) wu zhao zhao zhao zhao zhao wu zhao zhao five trillion trillion trillion trillion trillion five trillion trillion zhao zhao wu zhao zhao zhao wu zhao zhao trillion trillion five trillion trillion trillion five trillion trillion wu zhao five trillion (c) *wu zhao zhao wu zhao zhao zhao five trillion trillion five trillion trillion trillion Similarly, example (d) is well-formed, while (e) is not. Both of these are examples of number-names containing adjacent 'columns' in which the number of instances of zhao in one of the 'columns' is neither the predecessor nor the successor of the number of instances of zhao in the other column: (d) wu zhao zhao zhao zhao wu zhao zhao five trillion trillion trillion trillion five trillion trillion (e) *wu zhao zhao wu zhao zhao wu zhao zhao five trillion trillion five trillion trillion five trillion trillion zhao zhao trillion trillion The well-formed number-names that we have seen all follow a pattern in which larger clusters of zhao precede, from left to right, smaller clusters of zhao, while the ill-formed number-names do not adhere to such a requirement. All the well-formed number7 Needham (1959, p. 87) (also reproduced in Brainerd (1966a, p. 42)), gives a list of single-word number-names up to 1044. Yet all those beyond zhao are extremely archaic and are most likely not part of the vocabulary of a present-day native speaker.</Paragraph> <Paragraph position="2"> 8 In all of our examples, we omit possible instances of ling. This is a morpheme with numerous meanings--as is not uncommon in Chinese---including 'zero; 'remainder,' and, when used in between other morphemes in a number-name, we could say it serves as a type of conjunction. It is used in case of a gap of more than one decimal order within a number-name. Yet, if there is more than one such gap in a number-name, generally only one ling is used and this is in place of the rightmost gap. Number-names for multiples of &quot;10,' as are all of our examples, generally, though not necessarily, lack such use of ling. In any case, its use would not affect in any significant way the formal argumentation to be presented in Sections 4 and 5. (On the semantic evolution of ling, see Needham (1959, pp. 16-17)). Computational Linguistics Volume 17, Number 3 names composed only of instances of wu and zhao form the set J: J = {wu zhao kl wu zhaok2.., wu zhaoknlkl > k2 > ... > kn > 0}.</Paragraph> <Paragraph position="3"> This clearly lends empirical support to characteristics (i) and (ii) of Section 2. Yet, whether characteristic (ii) is a consequence of a linguistic constraint or of some other type of constraint is a question we leave open till Section 6.</Paragraph> <Paragraph position="4"> 4. Numeric Chinese and Tree Adjoining Languages In light of recent, and not so recent, definitions of languages lying between CFLs and CSLs, we are now capable of demonstrating that it is not only CFGs that fail to generate Numeric Chinese, but even more powerful grammars as well. We shall argue here that Numeric Chinese is not a TAL. The weak equivalence of TAGs, a modified version of Head Grammars and Linear Indexed Grammars has been shown in Vijayashanker (1988), while the weak equivalence of these three formalisms and Combinatory Categorial Grammars has been shown in Weir (1988) and Weir and Joshi (1988).</Paragraph> <Paragraph position="5"> TAGs perform certain manipulative adjunction operations on tree structures. HGs are similar to CFGs except that they allow head-wrapping operations in addition to the usual concatenation operation used in CFGs. LIGs are a restricted type of IGs, which were introduced by Aho (1968) as an extension of CFGs. In addition to terminals and nonterminals, IGs also have indices, or flags, which can be used in rewrite operations. Their use may lead to the generation of non-CFLs. LIGs restrict the way in which indices may be used. Combinatory Categorial Grammars are an extension of classical Categorial Grammars. The latter were introduced into the linguistic literature by Bar-Hillel (1953), based largely on the work of Lesniewski (1929) and Ajdukiewicz (1935) in the context of philosophical logic. The classical Categorial Grammar formalism is characterized by the use of the combinatory operation of functional application, and is extensionally equivalent to the CF formalism. Combinatory Categorial Grammars are augmented with the combinatory operation of functional composition used in a restricted way.</Paragraph> <Paragraph position="6"> In addition to generating all CFLs, these four types of grammars generate non-CFLs such as {anbncn}, {anbncndn}, and {WWIW E (a + b)*}. They exclude, however, languages such as {a&quot;bnc&quot;dne n } and others exhibiting a dependency of more than four columns, and {WWWIW E (a + b)*} and other languages exhibiting more than a single copy. Since these four formalisms have been proved to be weakly equivalent, i.e. they generate the exact same stringsets, we will limit our discussion below to TAGs. We now proceed to prove that NC is not a TAL in the weak sense. We begin by considering the following regular language R: R = {wu zhao + wu zhao + wu zhao + wu zhao + wu zhao + } Strings of R may be described as the concatenation of five columns, each column comprising one instance of wu followed by one or more instances of zhao. Let H --NC N R. Clearly, H = {wu zhao n wu zhao m wu zhaJ wu zhao k wu zhao j In > m > I > k > j > 0} Strings of H may be described as the concatenation of five columns, each column comprising one instance of wu followed by one or more instances of zhao and each</Paragraph> </Section> <Section position="2" start_page="282" end_page="289" type="sub_section"> <SectionTitle> Radzinski Chinese Number-Names </SectionTitle> <Paragraph position="0"> column having more instances of zhao than any column to its right. For notational convenience, we now define the homomorphism h such that:</Paragraph> <Paragraph position="2"> We apply the pumping lemma for TALs and arrive at a contradiction. This pumping lemma is given in Vijayashanker (1988, pp. 96-101) as Theorem 4.7: Pumping Lemma for TALs If L is a TAL, then there is a constant n such that if z E L and Izl > n then z may be written as z = UlVlWlV2uav3w2v4u3 with \[~)lWlV2V3W2V41 ~ n, \]VlV2?dB?d41 ~ 1 such that for all i > 0, UlV~WlVi2u2vi3w2?di4u 3 , L.</Paragraph> <Paragraph position="3"> It follows from this theorem that for any string in a TAL longer than a determined constant for that TAL, the string has at most four pumpable substrings. We now demonstrate that some long strings of L require more than four pumpable sub-strings in order to remain within L after pumping. Assume L is a TAL. Where n is the constant referred to by the lemma, corresponding to our L, consider the string z -= abn+4abn+3abn+2abn+lab n which is in L. Let us now number the columns of z 1-5 from left to right, where a column is an a followed by bs. Since all strings of L contain exactly five instances of a, VlV2vgv4 cannot contain as, else these could be pumped, yielding strings outside of L. Thus, VlV2V3V 4 must consist solely of bs. Since there are at most four pumpable substrings (Vl, v2, v3, and v4) and z comprises five columns, at least one column of z will not contain a pumpable substring. We can now look at the following two exhaustive cases: A. A column c with no pumpable substrings to the left of some column d with at least one pumpable substring.</Paragraph> <Paragraph position="4"> B. A column c with no pumpable substrings to the right of some column d with at least one pumpable substring.</Paragraph> <Paragraph position="5"> Case A. Let c be column 1 and let vl, v2, v3, and v4 be in columns 2, 3, 4 and 5, respectively. Let d be column 2. It suffices to pump up once, i.e. set i = 2, in order for column 1 to cease being longer than column 2, a necessary condition for all strings in L. As long as there is at least one pumpable substring in some column d to the right of c, any other instantiation of VlV2V3V 4 will yield a result in which a column will not be longer than another column to its right when setting i = 2.</Paragraph> <Paragraph position="6"> Case B. Let c be column 5 and let vl, v2, v3, and v 4 be in columns 1, 2, 3, and 4, respectively. Let d be column 4. Here pumping up once or more, i.e. setting i > 1, Computational Linguistics Volume 17, Number 3 will not suffice, because the newly created strings remain in L, since they meet the well-formedness conditions of L: Each column contains more instances of b than any one of the columns to its right. What we need in this case is to resort to pumping down, i.e. setting i = 0. In such a case, column 5 will cease being shorter than column 4, a necessary well-formedness condition for all strings in L. As long as there is at least one pumpable substring in some column d to the left of c, any other instantiation of VlV2VgV 4 will yield a result in which a column will not be longer than another column to its right when i = 0.</Paragraph> <Paragraph position="7"> These two cases exhaust all possible pumpings of non-fixed positions. Both lead to contradictions. Hence, the assumption that L is a TAL cannot be true. 9 *</Paragraph> <Paragraph position="9"> (cf. Theorem 3.6 in Vijayashanker (1988 pp. 76-77)) and under arbitrary homomorphism (cf. Corollary 4.1 in ibid., p. 94). By Lemma 1, L is not a TAL. Therefore, neither is NC. * 5. NC, ILs, MCTALs and Mildly CSLs After demonstrating in the previous section that NC is not a TAL, we are now faced with the following question: What type of non-TAL is it? Under the assumption that the only nonfinite phenomena in NC are strung-together number-names (of the type we have investigated), we would conjecture it is included in the class of Indexed Languages. The following IG(H) generates H, NC's proper subset responsible in our argument for NC's being a non-TAL: (our notation is a minor variant of the one used by Hopcroft and Ullman 1979, pp. 389-390).</Paragraph> <Paragraph position="10"> IG(H)=({S,T,U,V,W,X,Z}, {wu, zhao},{f,g}, P, S) where P comprises:</Paragraph> <Paragraph position="12"> However, H is not the only proper subset of NC having the characteristic of more than four columns. We could just as well have intersected NC with a different regular language containing more than five sequences of wu zhao +, thus yielding a proper 9 Since our proof has made no use of the condition in Theorem 4.7 requiring the length of VlWlV2V3W2V4 to be less than or equal to the constant n, perhaps even some weaker version of the pumping lemma for Tree Adjoining Languages lacking that condition would suffice for our purposes. In fact, such a version, attributed to Vijayashanker, is mentioned in passing by Weir (1987, p. 5). Furthermore, the pumping lemma for Head Languages, Theorem 45 in Roach (1987, pp. 321-325), lacks a condition of this sort (though added subsequently on p. 336). We have nevertheless opted for Vijayashanker's (1988) Theorem 4.7 because of its succinctness and perspicuity.</Paragraph> <Paragraph position="13"> subset of NC with a number of columns equal to the number of wu zhao + sequences in the intersected regular language. If this number were fixed, then we could simply define more nonterminals and add more productions to IG(H) yielding a language of, say, 6, 275, or 109438 columns, all depending on the fixed number of columns we wish to have. But since the number of columns in our construction is indefinite, such an IG would not suffice. For a proper subset J of NC, with arbitrarily many columns as described in Section 3, we need the following simpler IG(J):</Paragraph> <Paragraph position="15"> Are there known formalisms that generate NC, but not the entire class of ILs (or some non-ILs)? We attempt to answer this question by first considering M(ultiple) C(omponent) TAGs and other similar formalisms. MCTAGs have been discussed in the literature; for example, in Joshi (1987, pp. 110-113), Joshi et al. (1989, pp. 13-15), Vijayashanker et al. (1987, pp. 106-107), and Weir (1987, pp. 30-33; 1988, pp. 31-39).</Paragraph> <Paragraph position="16"> These are grammars whose generative capacity exceeds that of TAGs, since adjunction in MCTAGs is performed simultaneously on sequences of trees rather than on single trees. Pollard (1984, pp. 210-215) has defined a formalism called Generalized CFG.</Paragraph> <Paragraph position="17"> This formalism is an extension of CFG wherein the compositional operations in the grammar's production set need not be limited to functions of concatenation or syncategorematic insertion of terminals. Kasami et al. (1988, pp. 9-11) have shown that Generalized CFGs generate all the recursively enumerable sets. In addition, they have defined a restricted Generalized CFG called M(ultiple) CFG, which is stronger than Head Grammar. (HG is a Generalized CFG whose compositional operations are restricted to those of CFG plus head-wrapping.) Independent of Kasami et al.'s research, Vijayashanker et al. (1987, pp. 108-111) and Weir (1988, pp. 90-110) have introduced a formalism called L(inear) C(ontext-)F(ree) R(ewriting) S(ystem)s, whose definition turns out to be the same as that of MCFG. The composition operations in LCFRSs, or MCFGs, are restricted to being linear and nonerasing. In other words, they lack functions that 'copy,' 'erase,' or 'restructure' unbounded components of their arguments, and as such are 'size' preserving. Weir (1988, pp. 101-106) has shown that the classes of stringlanguages generated by MCTAGs and LCFRSs are the same. Given the equivalence of MCTALs, LCFRLs, and MCFLs, we focus our discussion below on MCTAGs, yet refer to LCFRSs and MCFGs if necessary or convenient.</Paragraph> <Paragraph position="18"> Manaster-Ramer (1987a, p. 233) has pointed out that &quot;\[the MCTAG formalism\] can handle quintuple and higher counting dependencies, but it still cannot deal with ... unbounded counting dependencies.&quot; That is, this formalism generates languages with a fixed number of columns, and not ones with an arbitrary number of them. Thus, while MCTAGs generate H, they do not generate J. Accordingly, Numeric Chinese cannot be accounted for by any MCTAG. We now proceed to prove this claim. We begin by considering the regular language R':</Paragraph> <Paragraph position="20"> Computational Linguistics Volume 17, Number 3 Strings of R' may be described as the concatenation of any arbitrary non-zero number of columns, each column comprising one instance of wu followed by one or more instances of zhao. Let J = NC N R'. Clearly, as presented already in Section 3, J = {wu zhaoklwu zhaok2...wu zhao kn I kl > k2 > ... > kn > 0}.</Paragraph> <Paragraph position="21"> Strings of J may be described as the concatenation of any arbitrary non-zero number of columns, each column comprising one instance of wu followed by one or more instances of zhao, and each column has more instances of zhao than any column to its right. For notational convenience, let K = h(J), where the homomorphism h is as defined previously in Section 4. Clearly,</Paragraph> <Paragraph position="23"> We apply a pumping lemma for MCTALs arriving at a contradiction. A pumping lemma for Multiple CFLs is given in Kasami et al. (1988, pp. 18-20) as Lemma 3.4. In order to achieve maximum analogy and uniformity with Vijayashanker's (1988) pumping lemma for TALs, we rephrase Kasami et al.'s Lemma 3.4, making it applicable to MCTALs: 1deg Pumping Lemma for MCTALs If L is a MCTAL, then there are constants n and m such that if z E L and \]z I > n then z may be written as z = UlVlWlSlU2V2W2S2U 3 ... UmVmWmSmUm+ 1 with ~j=l Ivjsjl > 1 such that for all i >__ 0, i i i i i i m bIlVlWlSlbI2V2W2S21&quot;13 * *, blmldmWmSml,lm+l ~- L. 10 In fact, Kasami et al.'s pumping lemma is weaker than ours. Their Lemma 3.4 (p. 18) is phrased as follows: For any ... mcfl L, if L is an infinite set then there exist some uj E T* \[where T is a finite set of terminals\] (1 <_ j < m + 1) \[for a fixed m associated with L\], vj, wj, sj E T* (1 <_ j G m) which satisfy m the following conditions: (1) ~j=l Ivjsj I > 0, (2) for any non-negative integer i, A i i i i i i</Paragraph> <Paragraph position="25"> There is an essential quantificational difference between Lemma 3.4 and our lemma. Lemma 3.4 claims that every infinite Multiple CFL has some strings, i.e. at least one and maybe more, which satisfy the pumping conditions. Such a lenLma suffices, for example, for proving the non-MCFness of a language with an indefinite amount of columns in which all columns are equal in length, since such a language does not contain a single string with a fixed number of pumpable substrings. \[See Lemma 3.5 in Kasami et al. (1988, p. 20).\] However, such a lemma is not strong enough to show that a language, such as K, with an indefinite amount of columns in which every column is longer than any column to its right, is not an MCFL. For example, the following string z E K, satisfies the pumping conditions of Lemrna 3.4: z = abldegdegdegdegdegdeg(n+2m)abn+2m-labn+2m-2.., abn+2m-(2m-1)ab n If vl is in the leftmost column of z and we only pump vl, as allowed by the lemrna, then we still remain in the language, for all i > 0. Our lernma, on the other hand, claims that the pumping conditions must be satisfied by a~ the strings of a MCTAL longer than a determined constant for that MCTAU With our lemma, it is enough to find even one single string, longer than the constant, that does not satisfy the pumping conditions, in order to prove, as we do below, that the language containing that string is not a Multiple Component Tree Adjoining Language.</Paragraph> <Paragraph position="26"> Interestingly, it is not necessary for us to prove our version of the pumping lemma: Kasami et al.</Paragraph> <Paragraph position="27"> have already done so! Their proof crucially depends on the following assumption (p. 19): &quot;Let us It follows from this lemma that, for any string in a MCTAL that is longer than a determined constant n associated with this MCTAL, the string has at most 2m pumpable substrings, where m is a fixed number associated with the particular MCTAL. We now demonstrate that some long strings of K require more than 2m pumpable substrings in order to remain within K after pumping. Assume K is a MCTAL. Where n and m are the constants for K corresponding to those referred to by the lemma, consider the string z = abn+2mabn+2m-labn+2m-2.., abn+2m-(2m-1)abn which is in K (since the number of columns in strings of K is not necessarily fixed). Let us now number the columns of z 1 thru 2m + 1 from left to right. The pumpable substrings, i.e. vjsj for all j = 1 to m, cannot contain any instances of a, since pumping as would yield new strings with columns longer than columns to their left, columns shorter than columns to their right, columns equal in length to other columns, or two adjacent as. None of these are in K. Thus, the pumpable substrings must consist solely of bs. The remainder of this proof is analogous to that of Lemma 1, mutatis mutandis. *</Paragraph> <Paragraph position="29"> guages and under substitution \[cfo Theorem 3.9 in Kasami et al. (1988, p. 21)\]. Homomorphism is simply a special case of substitution. MCFLs = MCTALs. By Lemma 2, K is not a MCTAL. Therefore, NC is not a MCTAL. * We may now begin to answer the question posed at the outset of this section: NC is not a MCTAL, but still appears to be an IL. 11 We fail, however, to find a well-studied and attractive formalism that would seem to generate NC without generating the entire class of ILs (or some non-ILs)22 Now, we are faced with the following question: if NC consider a derivation tree t of z E L such that Iz\] ~ q INf+l.&quot; A consequence of this assumption is that z satisfies the pumping conditions, hence there is at least one such string in any infinite Multiple Context-Free Language. Yet, their z was chosen arbitrarily from the set of strings in L longer than a determined constant for L (in their case qINt+l). Hence, all strings longer than that constant satisfy the pumping conditions. Thus, in fact, Kasami et al. proved implicitly something stronger than what they had claimed to prove.</Paragraph> <Paragraph position="30"> 11 Kac (1987, p. 451) mentions: &quot;It nonetheless remains to be seen what sort of increase in power over that possessed by these classes of grammars \[i.e. HG and TAG\] is sufficient to handle respectively sentences.&quot; The issue is raised after noting that certain instances of the English respectively construction map on to the five-column language {bhanbmdd k I 2 ~ h ~ n ~ m ~ 1 ~ k} and to additional multicolumn languages without limit on the number of columns they contain. An indirect consequence of our last conclusion is that, given the close resemblance between Kac's five-column language and our L, as well as, of course, between any of his other multicolumn languages and our corresponding subsets of K, we can now suggest that the increase in power sought will have to be of the type that allows for the generation of non-MCTALs without allowing for the generation of the entire class of ILs (or some non-ILs).</Paragraph> <Paragraph position="31"> 12 Such a formalism might be of the type similar to an augmented version of Combinatory Categorial Grammar as proposed by Steedman (1985) and discussed in Weir (1988, pp. 128-131) (together with suggested extensions to Linear Indexed Grammars) and Weir and Joshi (1988, p. 284). There do exist, in fact, two formalisms known to the author, which are restricted types of IGs, that generate the phenomena under discussion: a) If we make one simple change to the definition of our IG(J) so that Z is not a member of the set of nonterminals, but rather forms a singleton of intermediates, then what we end up with is an R-grammar. R-grammars are discussed in Aho (1968, p. 670). Only intermediates can consume indices and once an index is consumed, no new indices can be generated by the intermediates. Also, all productions involving an index consumption must be right linear in form (as are such rules in our IG(J)).</Paragraph> <Paragraph position="32"> Nevertheless, neither Aho (1968) nor Aho (1969), wherein these restrictions on IG are also discussed, Computational Linguistics Volume 17, Number 3 is not a MCTAL, then is it Mildly Context-Sensitive? To answer this, we must consider the properties that make a CSL mild. On this matter, Joshi (1985, p. 225) writes: I would like to propose that the three properties 1. limited cross-serial dependencies, 2. constant growth, and 3. polynomial parsing roughly characterize a class of grammars (and associated languages) that are only slightly more powerful than context-free grammars (context-free languages). I will call these mildly context-sensitive grammars (languages).</Paragraph> <Paragraph position="33"> Accordingly, the entire class of MCTALs is Mildly CS, but not the entire class of ILs, which, inter alia, includes languages that are non-constant growth such as a 2&quot; . We now investigate if NC possesses the Mildly CS properties.</Paragraph> <Paragraph position="34"> Polynomial Parsing. Since parsing is dependent on a grammar and we are not dealing here with a specific grammar but rather with a language, we can only consider polynomial recognition. Weir (1988, pp. 98-101) gives the following theorem: Theorem 4.4.1 If L is a language generated by a grammar of some formalism that is a LCFRS, then L can be recognized in polynomial time on a turing machine.</Paragraph> <Paragraph position="35"> By Weir's Theorem 4.4.1, strings in NC generated by a MCTAG are polynomially recognizable. We will now show that this is true for strings in NC not generated by a MCTAG. Such strings are precisely those that contain an indefinite amount of columns of strung-together number-names, and the set of these strings can be mapped to K via simple homomorphisms. The following transitions of a T(wo-) H(ead) F(inite) shows explicitly and unequivocally that the class of languages generated by such restricted IGs is indeed properly included within the class of ILs. In other words, although not intuitively likely, it may still be the case that the stringlanguage classes are equivalent.</Paragraph> <Paragraph position="36"> b) Fischer (1968) discusses Macro-like Grammars. One type of these is the O(utside) I(n) Grammars, which Fischer proves to be weakly equivalent to IGs. A particular restriction on the definition of OI Grammars yields Basic Grammars, whose class of object stringlanguages is properly contained in the class of ILs. This is because Basic Grammars generate only those languages that are generated both by OI Grammars and I(nside) O(ut) Grammars, i.e. their intersection, and there are OI languages (i.e. ILs) that are not IO languages. A restriction on the definition of Basic Grammars yields Linear Basic Grammars. Proper inclusion of the latter in the former is only conjectured. The following rather simple Linear Basic Grammar (and, of course, Basic Grammar) productions generate J (where the symbol 'A' refers to the empty string and the symbol '1' is a disjunction of right-hand sides of rules having a common left-hand side):</Paragraph> <Paragraph position="38"> However, Fischer conjectures that the class of Linear Basic Languages does not include the class of CFLs. A formalism that does not generate all CFLs would, most likely though perhaps arguably, seem unattractive for our purposes. Furthermore, although the class of Basic Languages does include the class of CFLs, both the class of Basic Languages and the class of Linear Basic Languages are not closed under inverse homomorphism and, as such, do not each form a full A(bstract) F(amily of) L(anguages).</Paragraph> <Paragraph position="39"> As Savitch (1989, p. 255) comments, &quot;... in some circles \[being a full AFL\] invests the class \[of languages\] with a certain respectability.&quot; Thus, again, a formalism whose class of object languages does not form a full AFL would, most likely though perhaps subject to counterarguments, seem unattractive for our purposes. The other language classes discussed in this paper each form a full AFL. (For more on Macro-like Grammars, see Fischer (1968).) only left to right. It is started in state q0 with both heads on the tape's leftmost square.</Paragraph> <Paragraph position="40"> Transitions are of the form (qi, w, x, qj) where qi is the current state, w the string to be read by head 1, x the string to be read by head 2, and q/' the state to enter. We use the symbol '&' to refer to the null string, i.e. &quot;do not read,&quot; and '#' as a right-edge marker. An input string is accepted iff one of the heads (or both) falls off the right edge and enters the final state qf.13 Our THFA(K) simultaneously uses its two heads in order to compare the number of bs in every two adjacent columns. It accepts a string iff the string has no column whose length is greater than or equal to the length of a column to its left. THFAs for acceptance of actual NC strings will be analogous to THFA(K).</Paragraph> <Paragraph position="41"> C(onstant) G(rowth). Based on work by Aravind Joshi and by Bob Berwick, Weir (1988, p. 3) presents a definition of CG as foUows: TM L is constant growth if there is a constant c0 and a finite set of constants C such that for all w c L where Iwl > Co there is a w' E L such that \[w I ~ \]w'\[ +c for some cEC. He also gives the following theorem: (ibid., pp. 96-97) Theorem 4.3.1 If L is a language generated by a grammar of some formalism that is a LCFRS, then L is a semilinear language.</Paragraph> <Paragraph position="42"> CG is a consequence of semilinearity. By Weir's Theorem 4.3.1, the set of strings in NC generated by a MCTAG is CG. We show now that the set of strings in NC not &quot; generated by a MCTAG is also CG, according to the Joshi-Berwick definition. Such 13 Our definition of a Two-Head Finite Automaton differs slightly from that given in Lewis and Papadimitriou (1981, pp. 306-307) wherein acceptance requires both heads to simultaneously move off the right end of the tape while entering a designated final state.</Paragraph> <Paragraph position="43"> 14 A similar definition is given in Berwick (1984, p. 198) and in Berwick and Weinberg (1984, p. 113). Computational Linguistics Volume 17, Number 3 strings are precisely those that contain an indefinite amount of columns of strung-together number-names. To these, we can always add one more instance of the strung-together element, whether zhao, yi, wan, or what have you, immediately to the left of the leftmost instance of that element, yielding a wellformed Chinese number-name.</Paragraph> <Paragraph position="44"> Thus, the set of strings in NC not generated by a MCTAG can &quot;grow&quot; by a constant of 1. Hence, by the Joshi-Berwick definition of CG, NC is CG.</Paragraph> <Paragraph position="45"> However, according to Alexis Manaster-Ramer (p.c.), the Joshi-Berwick definition of Constant Growth is flawed. While the Joshi-Berwick definition excludes languages such as {a n I n is prime} from being CG, it nonetheless includes the language {b*a n I n is prime}, since strings in the latter can &quot;grow&quot; by any constant due to b*. Yet, these two languages are rather similar and one would not expect them to differ in terms of CG. To avoid this state of affairs, Manaster-Ramer proposes the following embellished definition of Constant Growth: A language L over {al,... ,an} is Constant Growth iff there is a set C of n-tuples of constants {(Cu,..., Cnl),..., (Clm ..... Cnm)} , and for any string in L the number of occurrences of the different letters can be increased by the constants of such an n-tuple, respectively. Moreover, for every Constant Growth language there is a constant k such that any string containing at least k occurrences of a letter ai can be increased by at least one such n-tuple which has a non-zero value for the corresponding ci.</Paragraph> <Paragraph position="46"> Thus, if L is infinite and there is, in particular, no upper bound on the number of ais in its sentences, then there is a way of obtaining longer strings by means of increasing the number of occurrences thereof (possibly, but not necessarily, in conjunction with that of some other letter or letters). According to this definition, {b*a n I n is prime} is not CG, since given a string with enough as in it, it no longer suffices to increase the number of bs, as is possible under the Joshi-Berwick definition. Likewise, under Manaster-Ramer's definition, Numeric Chinese is not CG: Let c be the greatest of the constants by which the number of zhaos may be increased. Let p = max(k + 1, c + 1).</Paragraph> <Paragraph position="47"> Now consider the following string z in NC: z = wu zhaoPwu zhao p-1 ... wu zhao By the embellished definition, if NC is CG then it must have some string with a greater number of wus and of zhaos than in z. Now in order to increase the number of wus, one must at least create a new column of zhaos whose length must be at least p + 1. But that would mean an increase by more than c, which is the greatest of the constants.</Paragraph> <Paragraph position="48"> There is, thus, no way of increasing the number of zhaos while both remaining in NC and satisfying Manaster-Ramer's definition of CG. Hence, NC is not CG under the embellished definition, but is under the Joshi-Berwick definition.</Paragraph> <Paragraph position="49"> However, as pointed out by David Weir (p.c.), Manaster-Ramer's definition is also flawed. While it excludes both {a n I n is prime} and {b*a n I n is prime} from being Constant Growth, it nonetheless includes {a*b*a n I n is prime}, since in this last language, both the as as well as the bs, i.e. all the members of its alphabet, can &quot;grow&quot; by any constant due to a'b*. But here again, the three of these languages are rather similar and one would not expect them to differ in terms of CG. All this suggests that perhaps these types of definitions will not lead us to capture the intuitive notion of CG in its entirety. While Manaster-Ramer's definition attempts to be as analogous as possible to some definitions of semilinearity, it still does not model all and only that handles some particular dependency. Moreover, what is considered 'limited' may be open to more than one interpretation. Joshi et al. (1989, p. 3) suggest that the cross-serial dependencies in Dutch subordinate clauses, homomorphic to the single-copy language {WW I W E (a + b)* }, could be considered an instance of a limited cross-serial dependency, but not the language MIX, which consists of an equal number of as, bs, and cs in any linear order. The dependencies in NC are of multiple columns correlated in length. These are considered Mildly CS, as MCTAGs capture them. What MCTAGs fail to do is generate a language with such dependencies, such as NC, that also places no fixed limits on the number of columns in its strings. We ask then whether a dependency that leads to an indefinite number of columns correlated in length is to be considered &quot;limited&quot; or not. Since, as mentioned earlier, what is &quot;limited&quot; may be open to numerous interpretations, we leave this question unanswered for the time being.</Paragraph> <Paragraph position="50"> Thus, a grammar for NC will have the Mildly CS property of limited dependencies, in case a dependency between an indefinite number of columns correlated in length is to be considered limited.</Paragraph> <Paragraph position="51"> We conclude then that it is not entirely clear whether NC is Mildly CS, although intuitively it appears most likely not to be so. While NC is polynomially recognizable, we can say nothing about polynomial-time parsing for this language, as this would require reference to a particular grammar generating the language. It is constant-growth under one known definition, but not so under a somewhat better embellished one. Last, we cannot determine whether its dependencies of indefinite columns of correlated length are limited or not, lacking a straightforward and lucid definition for &quot;limited.&quot; However, in contrast to what we can conclude regarding NC and the vague notion of mild context-sensitivity, we do at least know for certain that NC is not a MCTAL.</Paragraph> </Section> </Section> <Section position="8" start_page="289" end_page="295" type="metho"> <SectionTitle> 6. The Linguistic Relevance of Our Formal Results </SectionTitle> <Paragraph position="0"> In the previous sections, we have shown that Numeric Chinese can be generated neither by a single- nor a multiple-component TAG. What are the implications of this for its proper superset Chinese, a natural language? These are not immediately obvious. After discussing Zwicky's (1963) argument, which, albeit weaker, resembles our own due to the similarity between his language P and our language K, in a study which surveyed and refuted earlier arguments calling for the non-context-freeness of NL, Pullum and Gazdar (1982, p. 502) mention: The interest of this argument \[i.e., Zwicky (1963)\] in the context of the study of natural languages is, however, greatly lessened by the fact that it deals with the internal structure of elements of a representational system for mathematics. We would maintain that knowledge of how to construct such number names (which, of course, has to be explicitly taught to children who speak English perfectly well) is knowledge of mathematics rather than of language.</Paragraph> <Paragraph position="1"> However, simply &quot;maintaining,&quot; with no further argumentation, that we are dealing here with &quot;knowledge of mathematics rather than of language,&quot; is far from clear. More evidence is needed in order to sustain this claim. We intend to show below that, in any case, maintaining this position resolves little.</Paragraph> <Paragraph position="2"> 'structures' (and not necessarily 'strings') and, therefore, any redefinition of this property would have to reflect the structure-based intuition.</Paragraph> <Paragraph position="3"> Computational Linguistics Volume 17, Number 3 Our formal argumentation, as well as that of Zwicky's, relies crucially on characteristics (i) and (ii) discussed in Section 2. These characteristics have been shown in Section 3 to hold for the Chinese number-name system. The issue now is whether they hold because of some linguistic constraint or because of some other constraint, perhaps mathematical in nature.</Paragraph> <Paragraph position="4"> Let us assume, for the sake of argument, that these characteristics, particularly (ii), are mathematical in nature. In other words, the reason that larger clusters of zhao precede smaller ones is due to some constraint within the mathematical component of the human cognitive faculty. This would make little sense, however, since addition is commutative. In other words, in purely arithmetical terms wu zhao zhao wu zhao is equivalent in value to wu zhao wu zhao zhao. But while the former is empirically acceptable in the language, the latter is not. Thus, the linear order constraint that larger clusters of zhao must precede smaller ones cannot be based on pure mathematics.</Paragraph> <Paragraph position="5"> If the nature of the constraint is mathematical in any way, it must be one that relates to the interface between the cognitive linguistic component and the cognitive mathematical one. 16 Since, for purposes of cognitive numerical computing, number-names must ultimately be translated somehow into some encoding of their particular arithmetic values, an NL-math interface is necessary. Let us assume that the linear order constraint applies in the interface. Two possibilities then exist: the constraint is either string-based or value-based. We explore these below.</Paragraph> <Paragraph position="6"> String-based. What we mean by this is the following: The syntax of some NL, in our case Chinese, generates cardinal number-names. These are passed to the cognitive mathematical component for further computation, via the NL-math interface.</Paragraph> <Paragraph position="7"> In this interface, a test is performed on strings of type (i) to see that they adhere to characteristic (ii). Yet, given our result from Section 5, an automaton with the power of, or weaker than, a MCTAG will not be able to perform such a test, since it cannot discriminate between these strings and those of type (i) that do not adhere to characteristic (ii) (hence cannot accept J). Our results, then, are directly translatable to apply not to the &quot;pure&quot; syntax of Chinese, but rather to the interface between Chinese and mathematics. We will have said then something substantially interesting about the interface between NL and mathematics, namely, that this interface must have tools that are weakly more powerful than a MCTAG.</Paragraph> <Paragraph position="8"> Value-based. Here again, the syntax generates cardinal number-names that are then passed on to the cognitive math component via the NL-math interface. However, in this case the interface performs a test not on the basis of substring length, but rather on the basis of the numeric content of each column. In other words, when encountered with a strung-together number-name, the interface calculates, say right-to-left, the numeric value of each column and compares this value with the numeric value of its adjacent left column. The numeric value of columns must increase from right to left.</Paragraph> <Paragraph position="9"> However, in order to calculate the numeric value of a column, the interface must have the capacity to compute expressions such as a trillion raised to the power of n and save the result for further comparison. If a machine performs this task, as most Turing-like machines do, by means of strings whose length is in direct proportion to the numeric value they encode, then this case is essentially equivalent to a string-based test, as discussed earlier. Yet, if the length of the encodings is not proportional to their value, 16 In this paper, we do not use &quot;interface&quot; to refer necessarily to some 'psychologically (or neurologically) real' processing model, but rather to a model emulating cognitive competence (in the same way that the syntax-semantics interface and syntax-phonology interface are understood).</Paragraph> <Section position="1" start_page="292" end_page="294" type="sub_section"> <SectionTitle> Radzinski Chinese Number-Names </SectionTitle> <Paragraph position="0"> then we cannot conclude that a machine with more power than a MCTAG must be necessary for the task. Perhaps transducers weaker than, or incomparable with, MCTAGs would do. Yet, are we to expect the interface to be capable of doing computations like these? If it can compute xY then it can, most likely, deal with many sophisticated computations. But if so, then how does it differ in any substantial way from the &quot;pure&quot; cognitive mathematical component (assumed to be Turing-equivalent)? In other words, why would the interface need the power to perform multiple multiplications if the math component does this anyway? We would expect the interface to be far weaker, if possible. It is indeed possible for it to be weaker, if we assume the test is string-based.</Paragraph> <Paragraph position="1"> But then, our result regarding non-MCTALs holds again.</Paragraph> <Paragraph position="2"> Thus, if the linear order constraint applies in the NL-math interface, then this interface will either have to be more powerful than a MCTAG in terms of string recognition, or powerful enough to perform calculations that are complex in terms of computing capacity. Given the lack of parsimony in having an NL-math interface that can perform the same tasks that the cognitive mathematical component performs, we conclude that, if the test applies in the interface, this test must be string-based, forcing the interface to be more powerful than, or at least incomparable with, a MCTAG.</Paragraph> <Paragraph position="3"> Another possibility exists: that the constraint is purely syntactic in nature. If so, we encounter the problem created by what Manaster-Ramer (1988, p. 102) refers to as Ziff's Law: while, for instance, wu zhao wu zhao zhao is not a well-formed numbername, nothing blocks it from being some other well-formed proper name, such as the title of a book, for example. Thus the grammar of Chinese can generate a string such as wu zhao wu zhao zhao, albeit not as a number-name but still as some acceptable proper name that can function as a noun-phrase in a sentence. If so, the intersection of R or R' and Chinese is not H or J, respectively, but rather R or R', respectively. Since L and K are not homomorphisms of R and R', respectively, an intersection-based proof would fail to say much about the weak generative capacity of the whole of Chinese.</Paragraph> <Paragraph position="4"> That is, in the weak sense, we have not shown that Chinese is beyond the generative power of a TAG or a MCTAG, although we have shown this to be true for its proper subset Numeric Chinese.</Paragraph> <Paragraph position="5"> The consequences of Ziff's Law are avoided, however, by invoking considerations of classificatory capacity. This notion was introduced in Manaster-Ramer (1987a, p. 238) and preliminarily defined there as &quot;the measure of a formalism's ability \[to\] classify a set of strings (and substrings) and specify which ones are like which other ones.&quot; In Radzinski (1990a, p. 122 and 1990b, pp. 85-86), we have further refined this definition to apply to CFGs: Let some nonterminal in the grammar ultimately be rewritten only as a string belonging to some particular construction. For example, if passive sentences are to be considered a construction in some NL PS, then let the CFG G(PS) include a member PASS in its set of nonterminal symbols which yields in one or more steps all and only passive sentences of PS. We say then that G(PS) classifies passives.</Paragraph> <Paragraph position="6"> According to the definitions of Linear Context-Free Rewriting Systems given in Weir (1988), (including CFGs, Tree Adjoining Grammars, Head Grammars, Linear Indexed Grammars, Combinatory Categorial Grammars, and Multiple Component TAGs) any LCFRS contains a set of nonterminals. We can thus extend our definition to apply to all LCFRSs, in addition to CFGs. 17 What we would wish to test then is whether any 17 Also, as mentioned in Radzinski (1990a, p. 122 and 1990b, p. 94), there no doubt exist other means for defining classification within CFGs. This point applies to all other LCFRSs as well. Rather than using a Computational Linguistics Volume 17, Number 3 LCFRS for Chinese can classify NC, whose strings clearly constitute a construction in Chinese separate from other noun-phrases. Such a test would fail, since, given our formal results, an arbitrary MCTAG, or other LCFRS, G(Chinese) could not include a nonterminal yielding all Chinese number-names in one or more steps. Hence, if the linear order constraint is purely syntactic in nature, then Chinese is a non-MCTAL on grounds of classificatory capacity.</Paragraph> <Paragraph position="7"> No matter which position one takes with respect to where the linear order constraint applies, interesting conclusions follow: either Chinese is not a Multiple Component Tree Adjoining Language by classificatory capacity considerations or the interfacelanguage between Chinese and cognitive math, something we know little of, is not an MCTAL when viewed as a formal language. In his &quot;Topic ... Comment&quot; column, Pullum (1986, p. 410) once wondered: ... neither the mathematics nor the facts \[discussed in Zwicky (1963)\] are in dispute, yet the issue still seems hard to resolve .... We have to be taught \[the constraint\] in math classes at school, and we do not acquire it with our language &quot; per se. On odd dates I still think this is right, but on even dates I think the argument has been unjustly overlooked. Zwicky thinks we \[i.e. Pullum and Gazdar\] were correct to dismiss it, but maybe he is wrong and it was the first valid argument that English is non-CF. The problem here is that we are not entirely sure what is a fact about a language and what is a fact about the culture associated with it.</Paragraph> <Paragraph position="8"> Yet it is hard to believe that the constraint is explicitly taught in math classes at school, since children rarely deal with numbers with such values. Also, there is no reason to assume that wan wan is taught in school, given its natural and common use by Chinese native speakers, regardless of their educational level. TM Moreover, the phenomenon is completely irrelevant to the culture associated with Chinese or with any other natural language exhibiting a similar behavior. It is a constraint related either to the &quot;pure&quot; syntax of Chinese or to the component interfacing between this syntax and the cognitive faculty for arithmetic reasoning. Both of these are hardly culture-based. 19 We end this section by presenting, for purposes of contrast, two mathematics-based formal arguments that are quite irrelevant to the study of natural language, although their proponents may have laid claims, or at least hinted, otherwise. The first of these is an argument against the context-freeness of English presented in Elster (1978, pp. 42-44) and refuted by Pullum and Gazdar (1982, pp. 479-481). Elster bases single nonterminal as a basis, classification can be achieved via a set of nonterminals or via some production rule that applies in all and only the derivations of a particular construction. These definitions, however, would be merely similar alternative formalizations of the same intuitive notion. 18 An anonymous referee has indicated that while this may be true for wan wan, forms such as zhao are surely taught (originally) in school. Yet, we believe that even if the single-word number-name zhao is learned explicitly only via formal education, its use within a strung-together number-name and the linear order pattern exhibited by such a long number-name is not. Rather, the constraints for forming strung-together number-names using zhao are the same as those for forming long number-names using wan. As mentioned in Section 3, the use of zhao instead of wan in long number-names results in better acceptability, as the former appears to be the highest valued number denoted by a single-word in modern Chinese.</Paragraph> <Paragraph position="9"> 19 It may very well be the case that the constraint could be shown to be a consequence of some other independent cognitive strategy, such as a version of Hurford's (1975) Packing Strategy, for example. This strategy for number-names suggests roughly that arithmetically higher-valued chunks belong higher up in the tree than do the lower-valued ones, analogous to the way one packs a suitcase of books: first the big fat heavy ones and then the lighter ones. However, formal claims (with mathematical rigor) are made precisely in order to get a better understanding and give a better description of what strategies are meant to account for informally.</Paragraph> </Section> <Section position="2" start_page="294" end_page="295" type="sub_section"> <SectionTitle> Radzinski Chinese Number-Names </SectionTitle> <Paragraph position="0"> his argument on sentences like the following: BI: The first two million numbers in the decimal expansion of 7r are ala2.., a2000000. B2: The first two million million numbers in the decimal expansion of 7r are ala2...a2000000000000.2o Bk: The first two (million) k numbers in the decimal expansion of 7r are ala2.., a2.106k. According to the pumping lemma for CFLs, any sufficiently long sentence of a CFL can be extended by indefinite repetition of, at most, two subparts without violation of grammaticality. The only possible pumpable substrings in sentences like Bk are within the cluster of millions and al ...an. If we pump the substrings 'millionq' and &quot;ar...at&quot; up once, then we end up with a sentence like C: C: The \[first\] two millionk+q.., numbers in the decimal expansion of p are 111 .. o atar . . . at... a2.106k Elster claims (p. 44) that C is not a grammatical sentence in English because: ... the number 'two million k+q' must be the same as the number '2.10 6k q- t -- r + 1', i.e. the same as the number of numbers in the decimal expansion. Note that this is a requirement not of mathematics, but of linguistics, just as the lack of grammaticality of the sentence, D: the two largest animals in the zoo are a mouse, is a matter of linguistics, and not of mathematics ...</Paragraph> <Paragraph position="1"> But, according to Elster, in order for C to be &quot;grammatical,&quot; the length of its decimal expansion must be longer: 2.106(k+q) . Thus, since there is a sufficiently long sentence in English that lacks, at most, two pumpable substrings, English is not CE As Pullum and Gazdar (1982, pp. 480-481) claim, Elster is wrong since he is assuming that English grammar requires, as seen from the ungrammaticality of D versus the grammaticality of the B sentences above, that the number of entities listed in a predicate correspond to the number named in the subject. Yet if D is ungrammatical, it is not because of such a requirement, but rather because of the number (i.e., singular/plural) disagreement it exhibits. If Elster were right, then the sentence &quot;The two largest animals in the zoo are Mickey, Minnie, and Donald&quot; would be ungrammatical. Yet this sentence is clearly grammatical, albeit infelicitous. Elster is confusing 20 The reproduction of this sentence in Pullum and Gazdar (1982, p. 479) lacks one instance of million. Thus, Elster's argument becomes largely incomprehensible if read only there. The version reproduced in Savitch et al. (1987, p. 149) contains the same typographical error. Also, notice that Elster's acceptance of sentences of this sort with strung-together millions, indicates that he too is of the opinion that strung-together number-names are empirically attested in English.</Paragraph> <Paragraph position="2"> Computational Linguistics Volume 17, Number 3 grammaticality with arithmetical felicity. His argument, thus, bears little on natural language.</Paragraph> <Paragraph position="3"> The second argument, this time not discussed by Pullum and Gazdar (1982), is one presented in Brainerd (1966b, pp. 119-124). Brainerd discusses &quot;verbal expressions,&quot; i.e. number-names, for rational numbers of the following type: zero point three six five four six five three six five three etc.</Paragraph> <Paragraph position="4"> three point two seven seven two seven seven two seven seven etc.</Paragraph> <Paragraph position="5"> These number-names for repeated decimals contain subparts of indefinite length that are repeated ad infinitum. However, given the periodicity of the repetition, the number, and hence its name, may be expressed by a convention requiring only a fixed amount of repetitions. A convention using merely one repetition suffices to argue that the set comprising the names for repeated decimals constitutes a non-CFL (two repetitions: a non-TAL). As Brainerd claims on p. 122: ... it is easy to show that the language L(R) \[comprising the numeral names for non-negative rational numbers in English\] possesses no context-free grammar.</Paragraph> <Paragraph position="6"> Indeed, such a language can be shown to be a non-CFL, via the strong pumping lemma for Context-Free Languages. Brainerd later writes on p. 124: It is perhaps not out of place to observe that if repeated decimals (in their English verbal form) are a part of the natural language, then repeated decimals constitute an example of a duplication-structure of arbitrary length in English.</Paragraph> <Paragraph position="7"> Thus, Brainerd suggests, albeit equivocally, that English, a natural language, is affected by the non-context-freeness of its sub-language consisting of the names for rational numbers. Yet, this could hardly have any basis whatsoever. On grounds of weak generative capacity, claiming that English is non-CF because it has &quot;a duplication-structure of arbitrary length,&quot; is as absurd as claiming that the regular language {a*b*c*} is a non-CFL because it is a proper superset of the non-CFL {anbncn}. Although pumping a string in L(R), longer than some constant, in accordance with the conditions set forth by the strong pumping lemma for CFLs, yields a string outside of L(R), it nevertheless yields one that is a well-formed number-name, albeit corresponding perhaps to an irrational number. However, considerations of classificatory capacity will not help here, since there is absolutely no linguistically based reason to assume that the set of names for rational numbers constitutes a construction distinct from names for other types of numbers. Thus, as with Elster's argument, Brainerd's has little bearing on natural language. Contrary to these two arguments, our claims, as we have already seen and discussed, definitely do bear either on the syntax of a natural language or on the interface between that natural language and the cognitive mathematical faculty.</Paragraph> </Section> </Section> class="xml-element"></Paper>