File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/03/p03-1014_metho.xml
Size: 29,678 bytes
Last Modified: 2025-10-06 14:08:12
<?xml version="1.0" standalone="yes"?> <Paper uid="P03-1014"> <Title>Integrated Shallow and Deep Parsing: TopP meets HPSG</Title> <Section position="3" start_page="1" end_page="1" type="metho"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> One of the strong points of deep processing (DNLP) technology such as HPSG or LFG parsers certainly lies with the high degree of precision as well as detailed linguistic analysis these systems are able to deliver. Although considerable progress has been made in the area of processing speed, DNLP systems still cannot rival shallow and medium depth technologies in terms of throughput and robustness. As a net effect, the impact of deep parsing technology on application-oriented NLP is still fairly limited.</Paragraph> <Paragraph position="1"> With the advent of XML-based hybrid shallow-deep architectures as presented in (Grover and Lascarides, 2001; Crysmann et al., 2002; Uszkoreit, 2002) it has become possible to integrate the added value of deep processing with the performance and robustness of shallow processing. So far, integration has largely focused on the lexical level, to improve upon the most urgent needs in increasing the robustness and coverage of deep parsing systems, namely This work was in part supported by a BMBF grant to the DFKI project WHITEBOARD (FKZ 01 IW 002).</Paragraph> <Paragraph position="2"> lexical coverage. While integration in (Grover and Lascarides, 2001) was still restricted to morphological and PoS information, (Crysmann et al., 2002) extended shallow-deep integration at the lexical level to lexico-semantic information, and named entity expressions, including multiword expressions.</Paragraph> <Paragraph position="3"> (Crysmann et al., 2002) assume a vertical, 'pipeline' scenario where shallow NLP tools provide XML annotations that are used by the DNLP system as a preprocessing and lexical interface. The perspective opened up by a multi-layered, data-centric architecture is, however, much broader, in that it encourages horizontal cross-fertilisation effects among complementary and/or competing components.</Paragraph> <Paragraph position="4"> One of the culprits for the relative inefficiency of DNLP parsers is the high degree of ambiguity found in large-scale grammars, which can often only be resolved within a larger syntactic domain. Within a hybrid shallow-deep platform one can take advantage of partial knowledge provided by shallow parsers to pre-structure the search space of the deep parser. In this paper, we will thus complement the efforts made on the lexical side by integration at the phrasal level. We will show that this may lead to considerable performance increase for the DNLP component. More specifically, we combine a probabilistic topological field parser for German (Becker and Frank, 2002) with the HPSG parser of (Callmeier, 2000). The HPSG grammar used is the one originally developed by (M&quot;uller and Kasper, 2000), with significant performance enhancements by B. Crysmann.</Paragraph> <Paragraph position="5"> In Section 2 we discuss the mapping problem involved with syntactic integration of shallow and deep analyses and motivate our choice to combine the HPSG system with a topological parser. Section 3 outlines our basic approach towards syntactic shallow-deep integration. Section 4 introduces various confidence measures, to be used for fine-tuning of phrasal integration. Sections 5 and 6 report on experiments and results of integrated shallow-deep parsing, measuring the effect of various integration parameters on performance gains for the DNLP component. Section 7 concludes and discusses possible extensions, to address robustness issues.</Paragraph> </Section> <Section position="4" start_page="1" end_page="1" type="metho"> <SectionTitle> 2 Integrated Shallow and Deep Processing </SectionTitle> <Paragraph position="0"> The prime motivation for integrated shallow-deep processing is to combine the robustness and efficiency of shallow processing with the accuracy and fine-grainedness of deep processing. Shallow analyses could be used to pre-structure the search space of a deep parser, enhancing its efficiency. Even if deep analysis fails, shallow analysis could act as a guide to select partial analyses from the deep parser's chart - enhancing the robustness of deep analysis, and the informativeness of the combined system.</Paragraph> <Paragraph position="1"> In this paper, we concentrate on the usage of shallow information to increase the efficiency, and potentially the quality, of HPSG parsing. In particular, we want to use analyses delivered by an efficient shallow parser to pre-structure the search space of HPSG parsing, thereby enhancing its efficiency, and guiding deep parsing towards a best-first analysis suggested by shallow analysis constraints.</Paragraph> <Paragraph position="2"> The search space of an HPSG chart parser can be effectively constrained by external knowledge sources if these deliver compatible partial subtrees, which would then only need to be checked for compatibility with constituents derived in deep parsing. Raw constituent span information can be used to guide the parsing process by penalizing constituents which are incompatible with the precomputed 'shape'. Additional information about proposed constituents, such as categorial or featural constraints, provide further criteria for prioritising compatible, and penalising incompatible constituents in the deep parser's chart.</Paragraph> <Paragraph position="3"> An obvious challenge for our approach is thus to identify suitable shallow knowledge sources that can deliver compatible constraints for HPSG parsing.</Paragraph> <Section position="1" start_page="1" end_page="1" type="sub_section"> <SectionTitle> 2.1 The Shallow-Deep Mapping Problem </SectionTitle> <Paragraph position="0"> However, chunks delivered by state-of-the-art shallow parsers are not isomorphic to deep syntactic analyses that explicitly encode phrasal embedding structures. As a consequence, the boundaries of deep grammar constituents in (1.a) cannot be pre-determined on the basis of a shallow chunk analysis (1.b). Moreover, the prevailing greedy bottom-up processing strategies applied in chunk parsing do not take into account the macro-structure of sentences.</Paragraph> <Paragraph position="1"> They are thus easily trapped in cases such as (2).</Paragraph> <Paragraph position="2"> pizza and Mary] drinks wine.</Paragraph> <Paragraph position="3"> In sum, state-of-the-art chunk parsing does neither provide sufficient detail, nor the required accuracy to act as a 'guide' for deep syntactic analysis.</Paragraph> </Section> <Section position="2" start_page="1" end_page="1" type="sub_section"> <SectionTitle> 2.2 Stochastic Topological Parsing </SectionTitle> <Paragraph position="0"> Recently, there is revived interest in shallow analyses that determine the clausal macro-structure of sentences. The topological field model of (German) syntax (H&quot;ohle, 1983) divides basic clauses into distinct fields - pre-, middle-,andpost-fields - delimited by verbal or sentential markers, which constitute the left/right sentence brackets. This model of clause structure is underspecified, or partial as to non-sentential constituent structure, but provides a theory-neutral model of sentence macro-structure.</Paragraph> <Paragraph position="1"> Due to its linguistic underpinning, the topological field model provides a pre-partitioning of complex sentences that is (i) highly compatible with deep syntactic analysis, and thus (ii) maximally effective to increase parsing efficiency if interleaved with deep syntactic analysis; (iii) partiality regarding the constituency of non-sentential material ensures robustness, coverage, and processing efficiency.</Paragraph> <Paragraph position="2"> (Becker and Frank, 2002) explored a corpus-based stochastic approach to topological field parsing, by training a non-lexicalised PCFG on a topological corpus derived from the NEGRA treebank of German. Measured on the basis of hand-corrected PoS-tagged input as provided by the NEGRA treebank, the parser achieves 100% coverage for length AK 40 (99.8% for all). Labelled precision and recall are around 93%. Perfect match (full tree identity) is about 80% (cf. Table 1, disamb +).</Paragraph> <Paragraph position="3"> In this paper, the topological parser was provided a tagger front-end for free text processing, using the TnT tagger (Brants, 2000). The grammar was ported to the efficient LoPar parser of (Schmid, 2000). Tagging inaccuracies lead to a drop of 5.1/4.7 percent- null if he there been had .</Paragraph> <Paragraph position="4"> dis- cove- perfect LP LR 0CB 2CB amb rage match in % in % in % in % Eval. on atomic (vs. parameterised) category labels. age points in LP/LR, and 8.3 percentage points in perfect match rate (Table 1, disamb A0).</Paragraph> <Paragraph position="5"> As seen in Figure 1, the topological trees abstract away from non-sentential constituency - phrasal fields MF (middle-field) and VF (pre-field) directly expand to PoS tags. By contrast, they perfectly render the clausal skeleton and embedding structure of complex sentences. In addition, parameterised category labels encode larger syntactic contexts, or 'constructions', such as clause type (CL-V2, -SUBCL, -REL), or inflectional patterns of verbal clusters (RK-VFIN,-VPART). These properties, along with their high accuracy rate, make them perfect candidates for tight integration with deep syntactic analysis.</Paragraph> <Paragraph position="6"> Moreover, due to the combination of scrambling and discontinuous verb clusters in German syntax, a deep parser is confronted with a high degree of local ambiguity that can only be resolved at the clausal level. Highly lexicalised frameworks such as HPSG, however, do not lend themselves naturally to a top-down parsing strategy. Using topological analyses to guide the HPSG will thus provide external top-down information for bottom-up parsing.</Paragraph> </Section> </Section> <Section position="5" start_page="1" end_page="5" type="metho"> <SectionTitle> 3 TopP meets HPSG </SectionTitle> <Paragraph position="0"> Our work aims at integration of topological and HPSG parsing in a data-centric architecture, where each component acts independently - in contrast to the combination of different syntactic formalisms within a unified parsing process.</Paragraph> <Paragraph position="1"> Data-based integration not only favours modularity, but facilitates flexible and targeted dovetailing of structures.</Paragraph> <Section position="1" start_page="3" end_page="5" type="sub_section"> <SectionTitle> 3.1 Mapping Topological to HPSG Structures </SectionTitle> <Paragraph position="0"> While structurally similar, topological trees are not fully isomorphic to HPSG structures. In Figure 1, e.g., the span from the verb 'h&quot;atte' to the end of the sentence forms a constituent in the HPSG analysis, while in the topological tree the same span is dominated by a sequence of categories: LK, MF, RK, NF.</Paragraph> <Paragraph position="1"> Yet, due to its linguistic underpinning, the topological tree can be used to systematically predict key constituents in the corresponding 'target' HPSG See Section 6 for comparison to recent work on integrated chunk-based and dependency parsing in (Daum et al., 2003). As, for example, in (Duchier and Debusmann, 2001). analysis. We know, for example, that the span from the fronted verb (LK-VFIN) till the end of its clause CL-V2 corresponds to an HPSG phrase. Also, the first position that follows this verb, here the leftmost daughter of MF, demarcates the left edge of the traditional VP. Spans of the vorfeld VF and clause categories CL exactly match HPSG constituents. Category CL-V2 tells us that we need to reckon with a fronted verb in position of its LK daughter, here 3, while in CL-SUBCL we expect a complementiser in the position of LK, and a finite verb within the right verbal complex RK, which spans positions 12 to 13.</Paragraph> <Paragraph position="2"> In order to communicate such structural constraints to the deep parser, we scan the topological tree for relevant configurations, and extract the span information for the target HPSG constituents. The resulting 'map constraints' (Fig. 1) encode a bracket type name that identifies the target constituent and its left and right boundary, i.e. the concrete span in the sentence under consideration. The span is encoded by the word position index in the input, which is identical for the two parsing processes.</Paragraph> <Paragraph position="3"> In addition to pure constituency constraints, a skilled grammar writer will be able to associate specific HPSG grammar constraints - positive or negative - with these bracket types. These additional constraints will be globally defined, to permit fine-grained guidance of the parsing process. This and further information (cf. Section 4) is communicated to the deep parser by way of an XML interface.</Paragraph> </Section> <Section position="2" start_page="5" end_page="5" type="sub_section"> <SectionTitle> 3.2 Annotation-based Integration </SectionTitle> <Paragraph position="0"> In the annotation-based architecture of (Crysmann et al., 2002), XML-encoded analysis results of all components are stored in a multi-layer XML chart.</Paragraph> <Paragraph position="1"> The architecture employed in this paper improves on (Crysmann et al., 2002) by providing a central Whiteboard Annotation Transformer (WHAT) that supports flexible and powerful access to and transformation of XML annotation based on standard</Paragraph> </Section> </Section> <Section position="6" start_page="5" end_page="8" type="metho"> <SectionTitle> XSLT engines </SectionTitle> <Paragraph position="0"> (see (Sch&quot;afer, 2003) for more details on WHAT). Shallow-deep integration is thus fully annotation driven. Complex XSLT transformations are applied to the various analyses, in order to We currently extract 34 different bracket types. We currently assume identical tokenisation, but could accommodate for distinct tokenisation regimes, using map tables. Advantages we see in the XSLT approach are (i) minimised programming effort in the target implementation language for XML access, (ii) reuse of transformation rules in multiple modules, (iii) fast integration of new XML-producing components. extract or combine independent knowledge sources, including XPath access to information stored in shallow annotation, complex XSLT transformations to the output of the topological parser, and extraction of bracket constraints.</Paragraph> <Section position="1" start_page="6" end_page="8" type="sub_section"> <SectionTitle> 3.3 Shaping the Deep Parser's Search Space </SectionTitle> <Paragraph position="0"> The HPSG parser is an active bidirectional chart parser which allows flexible parsing strategies by using an agenda for the parsing tasks.</Paragraph> <Paragraph position="1"> To compute priorities for the tasks, several information sources can be consulted, e.g. the estimated quality of the participating edges or external resources like PoS tagger results. Object-oriented implementation of the priority computation facilitates exchange and, moreover, combination of different ranking strategies. Extending our current regime that uses PoS tagging for prioritisation, null we are now utilising phrasal constraints (brackets) from topological analysis to enhance the hand-crafted parsing heuristic employed so far. Conditions for changing default priorities Every bracket pair CQD6</Paragraph> </Section> </Section> <Section position="7" start_page="8" end_page="8" type="metho"> <SectionTitle> DC </SectionTitle> <Paragraph position="0"> computed from the topological analysis comes with a bracket type DC that defines its behaviour in the priority computation. Each bracket type can be associated with a set of positive and negative constraints that state a set of permissible or forbidden rules and/or feature structure configurations for the HPSG analysis.</Paragraph> <Paragraph position="1"> The bracket types fall into three main categories: left-, right-,andfully matching brackets. A right-matching bracket may affect the priority of tasks whose resulting edge will end at the right bracket of a pair, like, for example, a task that would combine edges C and F or C and D in Fig. 2.</Paragraph> <Paragraph position="2"> Left-matching brackets work analogously. For fully matching brackets, only tasks that produce an edge that matches the span of the bracket pair can be affected, like, e.g., a task that combines edges B and C in Fig. 2. If, in addition, specified rule as well as feature structure constraints hold, the task is rewarded if they are positive constraints, and penalised if they are negative ones. All tasks that produce crossing edges, i.e. where one endpoint lies strictly inside the bracket pair and the other lies strictly outside, are penalised, e.g., a task that combines edges A and B.</Paragraph> <Paragraph position="3"> This behaviour can be implemented efficiently when we assume that the computation of a task pri- null A parsing task encodes the possible combination of a passive and an active chart edge.</Paragraph> <Paragraph position="4"> See e.g. (Prins and van Noord, 2001) for related work. to all depending tasks without having to check the bracket conditions repeatedly.</Paragraph> <Paragraph position="5"> For each task, it is sufficient to examine the startand endpoints of the building edges to determine if its priority is affected by some bracket. Only four cases can occur: 1. The new edge spans a pair of brackets: a match 2. The new edge starts or ends at one of the brackets, but does not match: left or right hit 3. One bracket of a pair is at the joint of the building edges and a start- or endpoint lies strictly inside the brackets: a crossing (edges A and B in Fig. 2) 4. No bracket at the endpoints of both edges: use the default priority For left-/right-matching brackets, a match behaves exactly like the corresponding left or right hit. Computing the new priority If the priority of a task is changed, the change is computed relative to the default priority. We use two alternative confidence values, and a hand-coded parameter ADB4DCB5,to adjust the impact on the default priority heuristics.</Paragraph> <Paragraph position="7"> ) specifies the confidence for a concrete bracket pair CQD6</Paragraph> </Section> <Section position="8" start_page="8" end_page="82" type="metho"> <SectionTitle> DC </SectionTitle> <Paragraph position="0"> of type DC in a given sentence, based on the tree entropy of the topological parse. conf D4D6 specifies a measure of 'expected accuracy' for each bracket type. Sec. 4 will introduce these measures. The priority D4B4D8B5 of a task D8 involving a bracket This way of calculating priorities allows flexible parameterisation for the integration of bracket constraints. While the topological parser's accuracy is high, we need to reckon with (partially) wrong analyses that could counter the expected performance gains. An important factor is therefore the confidence we can have, for any new sentence, into the best parse delivered by the topological parser: If confidence is high, we want it to be fully considered for prioritisation - if it is low, we want to lower its impact, or completely ignore the proposed brackets. We will experiment with two alternative confidence measures: (i) expected accuracy of particular bracket types extracted from the best parse delivered, and (ii) tree entropy based on the probability distribution encountered in a topological parse, as a measure of the overall accuracy of the best parse proposed - and thus the extracted brackets.</Paragraph> <Section position="1" start_page="9" end_page="9" type="sub_section"> <SectionTitle> 4.1 Conf D4D6 </SectionTitle> <Paragraph position="0"> : Accuracy of map-constraints To determine a measure of 'expected accuracy' for the map constraints, we computed precision and recall for the 34 bracket types by comparing the extracted brackets from the suite of best delivered topological parses against the brackets we extracted from the trees in the manually annotated evaluation corpus in (Becker and Frank, 2002). We obtain 88.3% precision, 87.8% recall for brackets extracted from the best topological parse, run with TnT front end. We chose precision of extracted bracket types as a static confidence weight for prioritisation.</Paragraph> <Paragraph position="1"> Precision figures are distributed as follows: 26.5% of the bracket types have precision AL 90% (93.1% in avg, 53.5% of bracket mass), 50% have precision AL 80% (88.9% avg, 77.7% bracket mass).</Paragraph> <Paragraph position="2"> 20.6% have precision AK 50% (41.26% in avg, 2.7% bracket mass). For experiments using a threshold on conf D4D6 (x) for bracket type DC, we set a threshold value of 0.7, which excludes 32.35% of the lowconfidence bracket types (and 22.1% bracket mass), and includes chunk-based brackets (see Section 5).</Paragraph> </Section> <Section position="2" start_page="9" end_page="82" type="sub_section"> <SectionTitle> 4.2 Conf CTD2D8 </SectionTitle> <Paragraph position="0"> : Entropy of Parse Distribution While precision over bracket types is a static measure that is independent from the structural complexity of a particular sentence, tree entropy is defined as the entropy over the probability distribution of the set of parsed trees for a given sentence. It is a useful measure to assess how certain the parser is about the best analysis, e.g. to measure the training utility value of a data point in the context of sample selection (Hwa, 2000). We thus employ tree entropy as a Further measures are conceivable: We could extract brackets from some n-best topological parses, associating them with weights, using methods similar to (Carroll and Briscoe, 2002). ized entropy on precision, recall, and coverage confidence measure for the quality of the best topological parse, and the extracted bracket constraints. We carry out an experiment to assess the effect of varying entropy thresholds AI on precision and recall of topological parsing, in terms of perfect match rate, and show a way to determine an optimal value for AI. We compute tree entropy over the full probability distribution, and normalise the values to be distributed in a range between 0 and 1. The normalisation factor is empirically determined as the highest entropy over all sentences of the training set.</Paragraph> <Paragraph position="1"> Experimental setup We randomly split the manually corrected evaluation corpus of (Becker and Frank, 2002) (for sentence length AK BGBC) into a training set of 600 sentences and a test set of 408 sentences. This yields the following values for the training set (test set in brackets): initial perfect match rate is 73.5% (70.0%), LP 88.8% (87.6%), and LR 88.5% (87.8%).</Paragraph> <Paragraph position="2"> Coverage is 99.8% for both.</Paragraph> <Paragraph position="3"> Evaluation measures For the task of identifying the perfect matches from a set of parses we give the following standard definitions: precision is the proportion of selected parses that have a perfect match - thus being the perfect match rate, and recall is the proportion of perfect matches that the system selected. Coverage is usually defined as the proportion of attempted analyses with at least one parse. We extend this definition to treat successful analyses with a high tree entropy as being out of coverage.Fig.3 shows the effect of decreasing entropy thresholds AI on precision, recall and coverage. The unfiltered set of all sentences is found at AI=1. Lowering AI in- null Possibly higher values in the test set will be clipped to 1. Evaluation figures for this experiment are given disregarding parameterisation (and punctuation), corresponding to the first row of figures in table 1.</Paragraph> <Paragraph position="4"> We determine f-measure as composite measure of precision and recall with equal weighting (AB=0.5). Results We use f-measure as a target function on the training set to determine a plausible AI. F-measure is maximal at AI=0.236 with 88.9%, see Figure 4.</Paragraph> <Paragraph position="5"> Precision and recall are 83.7% and 94.8% resp.</Paragraph> <Paragraph position="6"> while coverage goes down to 83.0%. Applying the same AI on the test set, we get the following results: 80.5% precision, 93.0% recall. Coverage goes down to 80.6%. LP is 93.3%, LR is 91.2%.</Paragraph> <Paragraph position="7"> Confidence Measure We distribute the complement of the associated tree entropy of a parse tree D8D6 as a global confidence measure over all brackets CQD6 extracted from that parse: conf B4CQD6B5, we set the threshold to BDA0AI BPBDA0BCBMBEBFBI BP BCBMBJBIBG.</Paragraph> </Section> </Section> <Section position="9" start_page="82" end_page="82" type="metho"> <SectionTitle> 5 Experiments </SectionTitle> <Paragraph position="0"> Experimental Setup In the experiments we use the subset of the NEGRA corpus (5060 sents, 24.57%) that is currently parsed by the HPSG grammar. null Average sentence length is 8.94, ignoring punctuation; average lexical ambiguity is 3.05 entries/word. As baseline, we performed a run without topological information, yet including PoS prioritisation from tagging.</Paragraph> <Paragraph position="1"> A series of tests explores the effects of alternative parameter settings. We further test the impact of chunk information. To this This test set is different from the corpus used in Section 4. In a comparative run without PoS-priorisation, we established a speed-up factor of 1.13 towards the baseline used in our experiment, with a slight increase in coverage (1%). This compares to a speed-up factor of 2.26 reported in (Daum et al., 2003), by integration of PoS guidance into a dependency parser. end, phrasal fields determined by topological parsing were fed to the chunk parser of (Skut and Brants, 1998). Extracted NP and PP bracket constraints are defined as left-matching bracket types, to compensate for the non-embedding structure of chunks. Chunk brackets are tested in conjunction with topological brackets, and in isolation, using the labelled precision value of 71.1% in (Skut and Brants, 1998) as a uniform confidence weight.</Paragraph> <Paragraph position="2"> Measures For all runs we measure the absolute time and the number of parsing tasks needed to compute the first reading. The times in the individual runs were normalised according to the number of executed tasks per second. We noticed that the coverage of some integrated runs decreased by up to 1% of the 5060 test items, with a typical loss of around 0.5%. To warrant that we are not just trading coverage for speed, we derived two measures from the primary data: an upper bound, where we associated every unsuccessful parse with the time and number of tasks used when the limit of 70000 passive edges was hit, and a lower bound, where we removed the most expensive parses from each run, until we reached the same coverage. Whereas the upper bound is certainly more realistic in an application context, the lower bound gives us a worst case estimate of expectable speed-up.</Paragraph> <Paragraph position="3"> Integration Parameters We explored the following range of weighting parameters for prioritisation (see Section 3.3 and Table 2).</Paragraph> <Paragraph position="4"> We use two global settings for the heuristic parameter AD. Setting AD to</Paragraph> </Section> <Section position="10" start_page="82" end_page="82" type="metho"> <SectionTitle> BD BE </SectionTitle> <Paragraph position="0"> without using any confidence measure causes the priority of every affected parsing task to be in- or decreased by half its value. Setting AD to 1 drastically increases the influence of topological information, the priority for rewarded tasks is doubled and set to zero for penalized ones.</Paragraph> <Paragraph position="1"> The first two runs (rows with A0P A0E) ignore both confidence parameters (conf D4D6BPCTD2D8 =1), measuring only the effect of higher or lower influence of topological information. In the remaining six runs, the impact of the confidence measures conf D4D6BPCTD2D8 is tested individually, namely +P A0E and A0P+E,by setting the resp. alternative value to 1. For two runs, we set the resp. confidence values that drop below a certain threshold to zero (PT, ET) to exclude un- null The experiments were run on a 700 MHz Pentium III machine. For all runs, the maximum number of passive edges was set to the comparatively high value of 70000. certain candidate brackets or bracket types. For runs including chunk bracketing constraints, we chose thresholded precision (PT) as confidence weights for topological and/or chunk brackets.</Paragraph> </Section> <Section position="11" start_page="82" end_page="82" type="metho"> <SectionTitle> 6 Discussion of Results </SectionTitle> <Paragraph position="0"> Table 2 summarises the results. A high impact on bracket constraints (AD1) results in lower performance gains than using a moderate impact (AD</Paragraph> </Section> <Section position="12" start_page="82" end_page="82" type="metho"> <SectionTitle> BD BE </SectionTitle> <Paragraph position="0"> ) (rows 2,4,5 vs. 3,8,9). A possible interpretation is that for high AD, wrong topological constraints and strong negative priorities can mislead the parser. Use of confidence weights yields the best performance gains (with AD</Paragraph> </Section> class="xml-element"></Paper>