File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/95/e95-1015_abstr.xml
Size: 1,662 bytes
Last Modified: 2025-10-06 13:48:24
<?xml version="1.0" standalone="yes"?> <Paper uid="E95-1015"> <Title>VP VB NP NP PP PP IN NP VP VBG NN</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We deal with the question as to whether there exists a polynomial time algorithm for computing the most probable parse tree of a sentence generated by a data-oriented parsing (DOP) model. (Scha, 1990; Bod, 1992, 1993a). Therefore we describe DOP as a stochastic tree-substitution grammar (STSG). In STSG, a tree can be generated by exponentially many derivations involving different elementary trees. The probability of a tree is equal to the sum of the probabilities of all its derivations.</Paragraph> <Paragraph position="1"> We show that in STSG, in contrast with stochastic context-free grammar, the Viterbi algorithm cannot be used for computing a most probable tree of a string. We propose a simple modification of Viterbi which allows by means of a &quot;select-random&quot; search to estimate the most probable tree of a string in polynomial time.</Paragraph> <Paragraph position="2"> Experiments with DOP on ATIS show that only in 68% of the cases, the most probable derivation of a string generates the most probable tree of that string. Therefore, the parse accuracy obtained by the most probable trees (96%) is dramatically higher than the parse accuracy obtained by the most probable derivations (65%).</Paragraph> <Paragraph position="3"> It is still an open question whether the most probable tree of a string can be deterministically computed in polynomial time.</Paragraph> </Section> class="xml-element"></Paper>