File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/97/a97-2014_metho.xml
Size: 4,208 bytes
Last Modified: 2025-10-06 14:14:34
<?xml version="1.0" standalone="yes"?> <Paper uid="A97-2014"> <Title>An English Grammar Checker as a Writing Aid for Students of English as a Second Language</Title> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> 2 Grammatical Mistakes </SectionTitle> <Paragraph position="0"> The current project started as part of a collaboration between the Computer and Information Science department and the English Language Programs at the University of Pennsylvania in an effort to provide a computational tool for students who are learning English as a second language. As an initial attempt, the present implementation focuses on certain dominant types of syntactic mistakes, as identified from the available set of students' essays.</Paragraph> <Paragraph position="1"> The kind of mistakes that are detected by the current system includes: Wrong capitalization (sentence initial, wrong lowercase/uppercase initial letter), missing fragments (subjects, objects, some prepositions, complements, articles, clauses, the, than, ere), some extra elements (e.g., the infinitive marker after auxiliary verbs), wrong agreement (number, case, etc), wrong verb form, and various mismatches (verb tense with adverbs, etc).</Paragraph> </Section> <Section position="5" start_page="0" end_page="0" type="metho"> <SectionTitle> 3 Gralnlnar Formalism </SectionTitle> <Paragraph position="0"> In order to handle ungrammatical expressions as well as grammaticM ones, we utilized the syntactic domain of locality implicit in categorial lexica l entries under the CCG framework, by including known variations of categorial association in the lexical entries. Multiple parts-of-speech are handled by putting all the assertions regarding grammatical parts-of-speech before those regarding ungrammatical ones.</Paragraph> </Section> <Section position="6" start_page="0" end_page="0" type="metho"> <SectionTitle> 4 Categorial Lexicon </SectionTitle> <Paragraph position="0"> In constructing a categorial lexicon for the grammar, we used an existing grammar-independent lexicon that had been made available as part of the XTAG project at the University of Pennsylvania. The original public domain lexicon contains about 37K quintuples, (INDEX, ENTRY, POS, CAT, FS), where POS and CAT are associated with a part-of-speech (such as V or N) and a set of categories, respectively, for the lexical item associated with ENTRY. The lexical items are supplemented by a morphology table, which has about 317K entries for various grammatical inflections. We have also made use of public domain word lists and consulted an on-line electronic dictionary for more commonly used lexical items.</Paragraph> <Paragraph position="1"> At the moment, the lexicon has about 18K nouns, 6.5K adjectives, 4.5K transitive verbs, 2K intransitive verbs, among other categories.</Paragraph> </Section> <Section position="7" start_page="0" end_page="0" type="metho"> <SectionTitle> 5 English Gralnlnar Checker </SectionTitle> <Paragraph position="0"> The user interface of the system includes a direct interaction with the Prolog interpreter, as well as an Internet interface. The Internet interface is written in HTML and CGI/PEt/L, which invokes the run-time image of the Prolog code.</Paragraph> <Paragraph position="1"> The project is still at an initial stage and there are many other issues that need to be addressed, such as performance and coverage of the mistake types. We believe that the use of a standard linguistic theory such as CCG made it possible to develop a grammar checker in a very short time frame and limited man power, as the existing large standard lexicons are be made readily available for it. Pending further funding, we believe that the prototype grammar checker can be brought up to an industry strength.</Paragraph> </Section> <Section position="8" start_page="0" end_page="0" type="metho"> <SectionTitle> Acknowledgements </SectionTitle> <Paragraph position="0"> We appreciate the students who agreed to provide thei~ essays for the present project. We also thank our beta testers, especially Laura Siegel and Jinah Park, who have shared their time to test the system.</Paragraph> </Section> class="xml-element"></Paper>