File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-2912_abstr.xml

Size: 1,134 bytes

Last Modified: 2025-10-06 13:45:34

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-2912">
  <Title>Unsupervised Parsing with U-DOP</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We propose a generalization of the supervised DOP model to unsupervised learning.</Paragraph>
    <Paragraph position="1"> This new model, which we call U-DOP, initially assigns all possible unlabeled binary trees to a set of sentences and next uses all subtrees from (a large subset of) these binary trees to compute the most probable parse trees. We show how U-DOP can be implemented by a PCFG-reduction technique and report competitive results on English (WSJ), German (NEGRA) and Chinese (CTB) data. To the best of our knowledge, this is the first paper which accurately bootstraps structure for Wall Street Journal sentences up to 40 words obtaining roughly the same accuracy as a binarized supervised PCFG. We show that previous approaches to unsupervised parsing have shortcomings in that they either constrain the lexical or the structural context, or both.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML