File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/01/w01-0521_abstr.xml

Size: 914 bytes

Last Modified: 2025-10-06 13:42:07

<?xml version="1.0" standalone="yes"?>
<Paper uid="W01-0521">
  <Title>Corpus Variation and Parser Performance</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Most work in statistical parsing has focused on a single corpus: the Wall Street Journal portion of the Penn Treebank. While this has allowed for quantitative comparison of parsing techniques, it has left open the question of how other types of text might a#0Bect parser performance, and how portable parsing models are across corpora. We examine these questions by comparing results for the Brown and WSJ corpora, and also consider which parts of the parser's probability model are particularly tuned to the corpus on which it was trained. This leads us to a technique for pruning parameters to reduce the size of the parsing model.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML