File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/82/c82-1024_metho.xml
Size: 18,884 bytes
Last Modified: 2025-10-06 14:11:23
<?xml version="1.0" standalone="yes"?> <Paper uid="C82-1024"> <Title>REFERENCES</Title> <Section position="2" start_page="0" end_page="151" type="metho"> <SectionTitle> CONSTRAINTS ON INCREMENTAL SYNTACTIC PROCESSORS </SectionTitle> <Paragraph position="0"> 1. Lexically driven syntactic processing. The first constraint derives from the fact that it is conceptual structures which serve as input to the tree formation process. A good strategy for translating these meanings into language begins by looking up words covering them (&quot;lexicalization&quot;). Subsequently, the processor attemps to build a syntactic framework which accommodates all words while respecting their syntactic properties (e.g. word class). In case of success, the result is a syntactic tree with lexical items as terminal nodes. In case of failure, one or more words are replaced by other lexical material which expresses the same meaning but whose syntactic properties are more favorable. The point we want to make here is that it is the syntactic properties and peculiarities of lexical items which guide the tree formation process.</Paragraph> <Paragraph position="1"> In short, syntactic processing is lexically driven. This feature requires special rules not found in current linguistic grammars where it is common practice to set up a linguistic framework (e.g., by applying phrase-structure rules) without reference to syntactic properties of lexical items \[I\]. Adopting this practice would presuppose that syntactic trees are directly computable from the shape of conceptual structures, that is, without the intermediation of lexical items. This supposition is valid only for conceptual structures which are virtually isomorphic with syntactic trees. Most probably, such an isomorphism does not hold for the structures delivered by the conceptualization system in human speakers.</Paragraph> <Paragraph position="2"> 152 G. KEMPEN and E. HOENKAMP 2. Hierarchy and order of constituents computed by different components. The second constraint hinges upon the-~ndependence between the order of conceptual fragments coming in and the order of the corresponding syntactic fragments. With the possible exception of languages with extremely flexible word order, grammar rules do not always permit a new syntactic fragment to be simply appended to the right-hand side of the current tree. Other spatial arrangements of the new fragment with respect to the current syntactic tree are possible, depending on the word order rules in the grammar. Sometimes these rules even ask for the presence of other elements between the current tree and a newly computed syntactic fragment. A clear example is provided by the position of verbs in main clauses of Dutch and German. Subject noun phrases and adverbial phrases cannot follow each other at the beginning of a main clause. The finite main verb or auxiliary is always in between: either NP-V-AP or AP-V-NP but not NP-AP-V or AP-NP-V. Grammars which use some version of the traditional phrase-structure rules do not keep word order apart from phrase membership (more precisely, constituent hierarchy from constituent order). For example, consider the following rules which express the above word order contingencies: S .... > NP+V+AP S .... > AP+V+NP Now suppose that the syntactic processor is working on a conceptual fragment which lexicalizes into a verb, and applies the first rule which says, among other things, that the verb needs an NP at its left-hand side. In the meantime a new conceptual fragment has come in which receives the syntactic shape of an AP. The first rule does have an AP slot, but not to the left of the verb. This implies the syntactic processor has to wait for a third conceptual fragment which can be worded in the form of an NP. At that point the syntactic processor can deliver its first output: an NP-V-AP utterance. The waiting time, that is, the period between onset of (conceptual) input and onset of (syntactic) output, would have beenshorter, had the syntactic processor picked the second phrase-structure rule. Then, output could already have begun after the second conceptual fragment (&quot;AP-V...&quot;) and closed off grammatically with &quot;...NP&quot;. Because the order of conceptual fragments is unknown~in advance, the syntactic processor can never be sure of having ~ade the best choice between rules. This problem does not arise in a rule system which allows word order to be computed independently of phrase membership. We conclude, therefore, that in an incremental syntactic processor it is desirable to have seperate components for tree (or rather &quot;mobile&quot;) formation and for word order.</Paragraph> <Paragraph position="3"> 3. Explicit computation of grammatical .(functional) relationships. Traditional phrase-struct-ure rules allow grammatical relationships (subject, direct object, nominal modifier, etc.) to be inferred from configurations of categorial nodes in the syntactic tree. This is not true of tree formation rules which leave left-to-right order of constituents undefined. If such a system contained a rule VP .... > V-NP-NP it would be impossible to determine which of the NPs served the function of direct object. Functional information of this kind is needed by the word order component (or, in languages with free word order, by the morphological case component).</Paragraph> </Section> <Section position="3" start_page="151" end_page="151" type="metho"> <SectionTitle> INCREMENIAL SENTENCE GENERATION 153 </SectionTitle> <Paragraph position="0"> An additional motivation for direct computation of functional syntactic relationships is provided by the lexicalization process. Constraint #I makes the prescription that it is choice of lexical items which guide the formation of syntactic trees rather than vice versa.</Paragraph> <Paragraph position="1"> Many lexical items require a specific syntactic environment, a typical example being verbs like give which cause the formation a VP with two NPs, for direct and indirect object respectively. In conjunction with customary phrase-structure rules, this property of give could be expressed in terms of a desired configuration of categorial nodes, e.g. (VP(NP )(NP )). This option is not available in a tree formation system which generates &quot;mobiles&quot;. Here, the lexical entry for give should explicitly reference direct and indirect objects as desired constituents.</Paragraph> <Paragraph position="2"> 4. Simultaneous construction of parallel branches of syntactic trees. Constraint #2 entails a work scheduling problem: if more than one branch is to descend from a given node, in what order should they be constructed by the syntactic processor? The standard solution, i.e. to develop constituents in their order of appearance in surface structure, is no longer applicable since left-to-right order is undefined at this stage \[2\]. In a lexically driven syntactic processor, the most efficient solution is a priority scheme based on the order of arrival of lexical items (cf. Constraint #I). This order, in turn, is the combined result of the order in which conceptual fragments become available and the manner of operation of the lexicalization process, and need not corrrespond at all to their surface structure order. For example, the verb-second rule of German and Dutch applies irrespective of whether the verb comes into play earlier or later than other lexical materials of the main clause.</Paragraph> <Paragraph position="3"> When computing a branch connecting a lexical item to the current syntactic tree, the processor has to take into account the functional relations this item maintains with other lexical items put forward by the lexicalization process (cf. Constraint #3)- For example, the noun designated as subject of a verb will receive a different place in the syntactic tree than the object noun. For the rest, there are no cross-branch computational dependencies forcing a systematic order upon the construction of branches of syntactic trees. This statement is supported by the success most grammar types obtain by having context-free rules generate deep structure trees (and sometimes even surface strUcture trees). This implies we can trust the above priority scheme (simply follow order of arrival of lexical items) even though it does not bring any syntactic expertise to bear. Now suppose the syntactic processor is capable of a certain amount of parallel processing. We are then permitted to assume that lexical items are attached to the tree simultaneously (again respecting their order of arrival) rather than sequentially. As a matter of fact, this is what human speakers seem to do, as witnessed by certain speech error phenomena (Garrett, 1975) and by reaction times to initiate sentences (Kempen & Huijbers, in press).</Paragraph> <Paragraph position="4"> 5. Operations on syntactic trees su__~ect to locality constraints.</Paragraph> <Paragraph position="5"> In an incremental syntactic processor, the application of tree formation and word order rules will often yield &quot;narrow&quot; trees dominating small sentence fragmentsdeg Now suppose that some trees have to undergo certain obligatory operations (e.g., transformations to be executed or anaphoric relationships to be established) and that such operations are triggered as soon as the tree matches a specific pattern.</Paragraph> <Paragraph position="6"> One can imagine &quot;horizontal&quot; (left-to-right) trigger patterns spanning a number parallel branches, &quot;vertical&quot; (top-down) patterns 154 G. KEMPEN and E. HOENKAMP specifying some configuration of dominating and dominated nodes on one branch, or &quot;mixed&quot; patterns. The pattern that triggers passivization is an example of a horizontal one involving several parallel branches (object NP, passive marker, main verb, and optional subject NP). Wh-fronting and Raising transformations are triggered by vertical or nearly vertical patterns. Incremental production favors vertical trigger patterns because they are more easily satisfied by narrow (partial) syntactic trees corresponding to fragmentary conceptual inputs. Horizontal patterns can only be matched by &quot;wider&quot; syntactic trees which correspond to more elaborate conceptual structures. This latter requirement, however, runs counter to the very idea of incremental sentence production.</Paragraph> <Paragraph position="7"> The interesting point is that conditions on transformations and other linguistic rules can usually be expressed in terms of (nearly) vertical node configurations. A clear example is provided by Koster's (1978) Locality Principles where a central role is played by dominance and command relationships between nodes. In the context of a computer implementation of our Procedural Grammar (Kempen & Hoenkamp, 1981), which was specially designed for the purpose of incremental production, we have attempted to make more precise the parallelism between vertical trigger patterns and locality constraints on transformations (see Hoenkamp, 1982, for a more formal approach).</Paragraph> <Paragraph position="8"> 6. Lexical transformations. By constructing a partial syntactic tree the syntactic processor commits itself to a limited range of continuation alternatives. The lexiealization process should be sensitive to such syntactic commitments by making available lexieal materials which are compatible with what has been said in earlier parts of the sentence. Take the example of a concept which, after having been expressed as a subject NP, turns out to be the patient of an action, as specified in a subsequent conceptual fragment. A typleal lexlcal realization of the action might be an active verb which prescribes the patient to be rendered as the object NP. However, this would entail incompatibility with the current syntactic tree.</Paragraph> <Paragraph position="9"> The solution is provided by a lexical passive transformation that, among other things, alters the pairings of conceptual cases with syntactic functions (causing patient to go with subject). The transformed lexical item is then successfully attached to the current tree. By attuning lexical items to the exigencies of incomplete syntactic trees, the lexicalization component greatly enhances the left-to-right planning capabilities of the syntactic processor \[3\].</Paragraph> <Paragraph position="10"> 7. Sentence production in two stages. In the foregoing we have not yet touched upon the issue--of where and when inflectional computations are carried out. The obvious placement of an inflectional component -- somewhere at the end of the tree formation process -- leads to an interesting problem. In many languages, including English, German and Dutch, clitics (monosyllabic function words) are optionally contracted with preceding lexlcal items. Some examples are John is Ill --> John's ill; will not --> won't; (Get.) unter dem Turm --> unterm Turm (&quot;under the to-w-6r&quot;--~ yon dem --> vom (&quot;o--f-~e'-~..'~.Clitie contraction implies merging the lexical items of two adjacent branches of a syntactic tree into a single word.</Paragraph> <Paragraph position="11"> In the context of Constraint #4 we have seen that the most efficient order of constructing branches of a syntactic tree simply copies the order of arrival of lexical items. This, in turn, implies that clitic contraction cannot be performed by the tree formation components (including the word order component): there is always a chance that a</Paragraph> </Section> <Section position="4" start_page="151" end_page="151" type="metho"> <SectionTitle> INCREMENTAL SENTENCE GENERATION 155 </SectionTitle> <Paragraph position="0"> later lexical item gets hold of a place inbetween the clitic and its predecessor. For instance, John be ill might be expanded into John will be ill, or yon ~em into yon all dem (&quot;of all the ...&quot;). Therefore, clitic contraction must take place after tree formation, that is, after the moment the syntactic processor decides that the current tree (possibly an incomplete one) is a suitable linguistic formulation of the conceptual input received so far. It follows there is a subsequent stage of processing which takes care of clitic contraction, and maybe of other aspects of the morphological shape of words.</Paragraph> <Paragraph position="1"> This latter addition is plausible from the point of view of efficiency. It does not make sense to have the tree formation components engage in detailed inflectional computations if some of these are undone later (namely, the computations that are superseded by clitic contraction).</Paragraph> <Paragraph position="2"> It is a remarkable fact that speech error data have given rise to a two-stage sentence production model with a similar division of labor between stages: roughly, syntactic tree formation versus inflectional.</Paragraph> <Paragraph position="3"> morphology (Garrett, 1975; Kempen & Huijbers, in press). These data also suggest that the second processing stage deals with the terminal nodes of a (possibly incomplete) syntactic tree in their left-to-right order.</Paragraph> </Section> <Section position="5" start_page="151" end_page="151" type="metho"> <SectionTitle> INCREMENTAL SENTENCE PRODUCTION IN MODELS OF THE SPEAKER </SectionTitle> <Paragraph position="0"> It will come as no surprise to the reader that the only computational model of sentence production which, in the authors opinion, satisfies all or most of the above constaints, is the one developed by the authors themselves (Kempen & Hoenkamp, 1981). We know of one other computational sentence generator whose design was explicitly concerned with incremental production. It was written by McDonald (1980, 1982) and embodies a broad range of syntactic constructions.</Paragraph> <Paragraph position="1"> However, this model fails to distinguish hierarchical from word order rules and, consequently, violates Constraint #3. We cannot judge whether removal of this shortcoming will necessitate drastic changes to the rest of the program.</Paragraph> <Paragraph position="2"> The type of grammar embodied by the Kempen & Hoenkamp model (Procedural Grammar) is similar to Lexical Functional Grammar (Kaplan & Bresnan, 1982; see also Bresnan, 1981). The main difference concerns the attitude towards transformations. In Lexical Functional Grammar, surface trees are base-generated and no transformational component is needed. If Kaplan & Bresnan motivate their rejection of a transformational component on psychological grounds \[4\], we disagree. Neither incremental production nor any other known fact about human sentence production processes argues for complete banishment of transformational operations on syntactic trees.</Paragraph> <Paragraph position="3"> Procedural Grammar is unique in its ability to deal effectively with conceptual inputs which may change on line. A conceptual structure which is altered after it has been expressed linguistically causes the processor to backtrack and to make &quot;repairs&quot;.</Paragraph> <Paragraph position="4"> Acknowledgements. The work reported in this paper was supported by a grant from ZWO, the Netherlands Organization for Pure Scientific Research. We are indebted to Patrick Hudson for his valuable comments. null 156 G. KEMPEN and E. HOENKAMP</Paragraph> </Section> <Section position="6" start_page="151" end_page="151" type="metho"> <SectionTitle> NOTES </SectionTitle> <Paragraph position="0"> \[I\] Categcrial grammars form an exception here. However, a processor based on this grammar type violates Constraints #2 and #3: categorial rules presuppose left-to-right order of lexical items, and make no use of functional syntactic relations.</Paragraph> <Paragraph position="1"> \[2\] This solution is the one that has been adopted of old, from Yngve (1960), via ATN-based generators (e.g. Simmons & Slocum, 1972; Anderson, 1976) to McDonald (1980).</Paragraph> <Paragraph position="2"> \[3\] Lexical transformations may involve other types of alterations as well, e.g., derivational morphological operations and insertion 'of function words. Actually, the addition of function words and inflections (or, rather, inflectional presciptions) is another general possibility for the lexiealization component to accommodate a lexieal item to properties of the current syntactic tree. (Inflectional prescriptions are executed during a subsequent processing stage; see Constraint #7.) \[4\] Book (1982, p. 28) opts for Gazdar's (1981) context-free grammars because they are &quot;much mope compatible with on-line processing models than transformational grammars&quot;.</Paragraph> </Section> class="xml-element"></Paper>