File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/p04-2009_intro.xml
Size: 3,013 bytes
Last Modified: 2025-10-06 14:02:23
<?xml version="1.0" standalone="yes"?> <Paper uid="P04-2009"> <Title>Robust VPE detection using Automatically Parsed Text</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> Ellipsis is a linguistic phenomenon that has received considerable attention, mostly focusing on its interpretation. Most work on ellipsis (Fiengo and May, 1994; Lappin, 1993; Dalrymple et al., 1991; Kehler, 1993; Shieber et al., 1996) is aimed at discerning the procedures and the level of language processing at which ellipsis resolution takes place, or ambiguous and difficult cases. The detection of elliptical sentences or the identification of the antecedent and elided clauses within them are usually not dealt with, but taken as given. Noisy or missing input, which is unavoidable in NLP applications, is not dealt with, and neither is focusing on specific domains or applications. It therefore becomes clear that a robust, trainable approach is needed.</Paragraph> <Paragraph position="1"> An example of Verb Phrase Ellipsis (VPE), which is detected by the presence of an auxiliary verb without a verb phrase, is seen in example 1.</Paragraph> <Paragraph position="2"> VPE can also occur with semi-auxiliaries, as in example 2.</Paragraph> <Paragraph position="3"> (1) John3 floves his3 wifeg2. Bill3 does1 too.</Paragraph> <Paragraph position="4"> (2) But although he was terse, he didn't frage at meg2 the way I expected him to1.</Paragraph> <Paragraph position="5"> Several steps of work need to be done for ellipsis resolution : 1. Detecting ellipsis occurrences. First, elided verbs need to be found.</Paragraph> <Paragraph position="6"> 2. Identifying antecedents. For most cases of ellipsis, copying of the antecedent clause is enough for resolution (Hardt, 1997).</Paragraph> <Paragraph position="7"> 3. Resolving ambiguities. For cases where ambiguity exists, a method for generating the full list of possible solutions, and suggesting the most likely one is needed.</Paragraph> <Paragraph position="8"> This paper describes the work done on the first stage, the detection of elliptical verbs. First, previous work done on tagged corpora will be summarised. Then, new work on parsed corpora will be presented, showing the gains possible through sentence-level features. Finally, experiments using unannotated data that is parsed using an automatic parser are presented, as our aim is to produce a stand-alone system.</Paragraph> <Paragraph position="9"> We have chosen to concentrate on VP ellipsis due to the fact that it is far more common than other forms of ellipsis, but pseudo-gapping, an example of which is seen in example 3, has also been included due to the similarity of its resolution to VPE (Lappin, 1996). Do so/it/that and so doing anaphora are not handled, as their resolution is different from that of VPE (Kehler and Ward, 1999). (3) John writes plays, and Bill does novels.</Paragraph> </Section> class="xml-element"></Paper>