File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/79/p79-1006_intro.xml
Size: 3,384 bytes
Last Modified: 2025-10-06 14:04:16
<?xml version="1.0" standalone="yes"?> <Paper uid="P79-1006"> <Title>UNGRAMHATICALITY AND EXTRA-GRAMMATICALITY IN NATURAL LANGUAGE UNDERSTANDING SYSTEMS</Title> <Section position="3" start_page="0" end_page="19" type="intro"> <SectionTitle> II.I Co-Occurrence Violations </SectionTitle> <Paragraph position="0"> Our first class of errors can be connected to co-occurrence restrictions within a sentence. There are many occassions in a sentence where two parts or more must agree (= indicates an ill-formed or ungrammatical sentence): =Draw a circles.</Paragraph> <Paragraph position="1"> &quot;I will stay from now under midnight.</Paragraph> <Paragraph position="2"> The errors in the above involve coordination between the underlined words. The first example illustrates simple agreement problems. The second involves a complicated relation between at least the three underlined terms. Such phenomena do occur naturally. For example, Shore ($H077\] analyzes fifty-six freshman English papers written by Black college students and reveals patterns of nonstandard usage ranging from uninflected plurals, possessives, and third person singulars to overinflection (use of inappropriate endings.) For co-occurrence violations, the blocks that keep inputs from being parsed as the user intended arise from a failure of a test on an arc or the failure to satisfy an arc type restriction, e.g., failure of a word to be in the correct category. The essential block in the first example would likely occur on an agreement test on an arc accepting a noun, The essential blockage in the second example is likely to come from failure of the arc testing the final preposition.</Paragraph> <Paragraph position="3"> In handling ellipsis, the most relevant distinction to make is between contextual and telegraphic ellipsis. Contextual ellipsis occurs when a form only makes proper sense in the context of other sentences. For example, the form ePresident Carter has.</Paragraph> <Paragraph position="4"> seems ungrammatical without the preceding question form Who has a daughter named Amy? President Carter has.</Paragraph> <Paragraph position="5"> Telegraphic ellipsis, on the other hand, occurs when a form only makes proper sense in a particular situation. For example, the tome 3 chairs no waiting (sign in barber shop) Yanks split (headline in sports section) Profit margins for each product (query submitted to a NLU system) are cases of telegraphic ellipsis with the situation noted In parentheses. The final example Is from an experimental study of NLU for management information which indicated that such forms must be considered \[MAL75\].</Paragraph> <Paragraph position="6"> Another type of unarammaticality related to ellipsis occurs when the user puts unnecessary words or phrases In an utterance. The reason for an extra word may be a change of intention In the middle of an utterance, an oversight, or simply for emphasis. For example, * Draw a llne with from here to there.</Paragraph> <Paragraph position="7"> &quot;List prices of single unit prices for 72 and 73. The second example comes from Malhotra \[MALT5\]. The best way to see the errors In terms of the ATN is to think of the user as trylng to complete a path through the grammar, but having produced an input that has too many or too few forms necessary to traverse all arcs,</Paragraph> </Section> class="xml-element"></Paper>