File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/p03-2033_intro.xml

Size: 3,322 bytes

Last Modified: 2025-10-06 14:01:49

<?xml version="1.0" standalone="yes"?>
<Paper uid="P03-2033">
  <Title>A Debug Tool for Practical Grammar Development</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> There is an increasing need for syntactical parsers for practical usages, such as information extraction. For example, Yakushiji et al. (2001) extracted argument structures from biomedical papers using a parser based on XHPSG (Tateisi et al., 1998), which is a large-scale HPSG. Although large-scale and general-purpose grammars have been developed, they have a problem of limited coverage.</Paragraph>
    <Paragraph position="1"> The limits are derived from deficiencies of grammars themselves. For example, XHPSG cannot treat coordinations of verbs (ex. &amp;quot;Molybdate slowed but did not prevent the conversion.&amp;quot;) nor reduced relatives (ex. &amp;quot;Rb mutants derived from patients with retinoblastoma.&amp;quot;). Finding these grammar defects and modifying them require tremendous human effort. null Hence, we have developed willex that helps to improve the general-purpose grammars. Willex has two major functions. First, it reduces a human workload to improve the general-purpose grammar through using language intuition encoded in syntactically tagged corpora in XML format. Second, it records data of grammar defects to allow developers to have a whole picture of parsing errors found in the target corpora to save debugging time and effort by prioritizing them.</Paragraph>
    <Paragraph position="2"> 2 What Is the Ideal Grammar Debugging? There are already other grammar developing tools, such as a grammar writer of XTAG (Paroubek et al., 1992), ALEP (Schmidt et al., 1996), ConTroll (G&amp;quot;otz and Meurers, 1997), a tool by Nara Institute of Science and Technology (Miyata et al., 1999), and [incr tsdb()] (Oepen et al., 2002). But these tools have following problems; they largely depend on human debuggers' language intuition, they do not help users to handle large amount of parsing results effectively, and they let human debuggers correct the bugs one after another manually and locally.</Paragraph>
    <Paragraph position="3"> To cope with these shortcomings, willex proposes an alternative method for more efficient debugging process.</Paragraph>
    <Paragraph position="4"> The workflow of the conventional grammar developing tools and willex are different in the following ways. With the conventional tools, human debuggers must check each sentence to find out grammar defects and modify them one by one. On the other hand, with willex human debuggers check sentences that are tagged with syntactical structure, one by one, find grammar defects, and record them, while willex collects the whole grammar defect records.</Paragraph>
    <Paragraph position="5"> Then human debuggers modify the found grammar defects. This process allows human debuggers to make priority over defects that appear more frequently in the corpora, or defects that are more critical for purposes of syntactical parsing. Indeed, it is possible for human debuggers using the conventional tools to collect and modify the defects but willex saves the trouble of human debuggers to collect defects to modify them more efficiently.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML