File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/94/c94-1063_intro.xml
Size: 4,078 bytes
Last Modified: 2025-10-06 14:05:35
<?xml version="1.0" standalone="yes"?> <Paper uid="C94-1063"> <Title>PARSING A FLEXIBLE WORD ORDER LANGUAGE</Title> <Section position="3" start_page="0" end_page="0" type="intro"> <SectionTitle> 1. INTRODUCTION </SectionTitle> <Paragraph position="0"> Natural languages exhibit significant word order (WO) variation and intricate ordering rules. Despite the fact that specific languages show less varmtion and complexity in such rules (e.g. those characterized by either fixed, or totall 3, free, WO), the vast majority of world languages lie somewhere in-between these two extremes (e.g. Steele 1981). Importantly, even the proclaimed examples of rigid WO languages (English) exhibit variation, whereas those with proclaimed total scrambling (Warlpiri; cf. llale 198.1) show restrictions (Kashket 1987).</Paragraph> <Paragraph position="1"> Therefore, we need general grammar formalism, capable of processing &quot;flexible&quot; WO (i.e. complex WO regularities, including both extremes).</Paragraph> <Paragraph position="2"> There seem to be a number of requirements that such a formalism should (try to) fulfil (e.g. Pericliev and Grigorov 1992). Among these stand out the formalism's: (i) Expressive power, i.e. capability of (reasonably) handling complex WO phenomena, or &quot;flexible&quot; WO.</Paragraph> <Paragraph position="3"> (ii) Linguistic felicity, i.e. capability of stating concisely and declaratively WO roles in a way maximally approximating linguistic parlance in similar situations.</Paragraph> <Paragraph position="4"> (iii) Modularity, i.e. the separation of constituency rules from the rules pertaining to the linearization of these constituents (for there may be many, and diverse, reasons for wanting linearization (and constituency) rules easily modifiable, incl. the transparency of WO statements, the imprecision of our current knowledge of ordering rules or the wish to tailor a system to a domain with specific WO).</Paragraph> <Paragraph position="5"> (iv) Reversibility, i.e. the ability of a system to be used for both parsing and generation (the reason being that, even if the system is originally intended for a parser, complex WO rules may be conveniently tested in the generation mode; in this sense it is not incidental that e.g. Kay & Kartttmen 1984 have first constructed a generator, and used it as a tool in testing the (WO) rules of their grammar, and only then have converted it into a parser).</Paragraph> <Paragraph position="6"> In this paper, we present a logic-based formalism which attempts to satisfy the above requirements. A review shows that most previous approaches to WO within the logic grammars paradigm (Dahl & Abramson 1990) have not been satisfactory. Definite Clause Grammar, DCG, (Pereira & Warren 1980), with their CF-style rules, are not modular (in the sense above), so will have to specify explicitly each ordering of constituents in a separate rule, which results in an intolerably great number of rules in parsing a free WO language (e.g. for 5 constituents, which may freely permute, the number of rules is 5! = 120). Other approaches center around the notion of a &quot;gap&quot; (or &quot;skip&quot;). In Gapping Grammar (GG), for instance (Dahl & Abramson 1984, esp. Dahl 1984), where a rule with a gap may be viewed as a recta-rule, standing for a set of CF rules, free WO is more economically expressed, however, due the unnaturahmss of expressing permutations by gaps, GGs generally are clumsy for expressing tlexible WO, WO is not declaratively and modularly expressed, and GGs cannot be used for generation (being besides not efficiently implementable). Another powerful formalism, Contextual Discontinuous Gr,'unmar (Saintl)izier 1988), which overcomes the GGs problems with generative capacity, is also far from being transparent and declarative in expressing WO (e.g. rules with fixed WO arc transformed into free order ones by intruducing special rules, containing symbols with no linguistic motivation, etc.).</Paragraph> </Section> class="xml-element"></Paper>