File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/01/h01-1014_metho.xml

Size: 17,567 bytes

Last Modified: 2025-10-06 14:07:32

<?xml version="1.0" standalone="yes"?>
<Paper uid="H01-1014">
  <Title>Converting Dependency Structures to Phrase Structures</Title>
  <Section position="2" start_page="0" end_page="1" type="metho">
    <SectionTitle>
2. CONVERTING PHRASE STRUCTURES
TO DEPENDENCY STRUCTURES
</SectionTitle>
    <Paragraph position="0"> The notion of head is important in both phrase structures and dependency structures. In many linguistic theories such as X-bar theory and GB theory, each phrase structure has a head that determines the main properties of the phrase and a head has several levels of projections; whereas in a dependency structure the head is linked to its dependents. In practice, the head information is explicitly marked in a dependency Treebank, but not always so in a phrase-structure Treebank. A common way to find the head in a phrase structure is to use a head percolation table, as discussed in [7, 1] among others. For example, the entry (S right S/VP) in the head percolation table says that the head child  of an S node is the first child of the node from the right with the label S or VP. Once the heads in phrase structures are found, the conversion from phrase structures to dependency structures is straightforward, as shown below: (a) Mark the head child of each node in a phrase structure, using the head percolation table.</Paragraph>
    <Paragraph position="1">  The head-child of a node XP is the child of the node XP that is the ancestor of the head of the XP in the phrase structure. .</Paragraph>
    <Paragraph position="2"> (b) In the dependency structure, make the head of each non-head-child depend on the head of the head-child.</Paragraph>
    <Paragraph position="3"> Figure 1 shows a phrase structure in the English Penn Treebank [8]. In addition to the syntactic labels (such as NP for a noun phrase), the Treebank also uses function tags (such as SBJ for the subject) for grammatical functions. In this phrase structure, the root node has two children: the NP and the VP. The algorithm would choose the VP as the head-child and the NP as a non-head-child, and make the head Vinkin of the NP depend on the head join of the VP in the dependency structure. The dependency structure of the sentence is shown in Figure 2. A more sophisticated version of the algorithm (as discussed in [10]) takes two additional tables (namely, the argument table and the tagset table) as input and produces dependency structures with the argument/adjunct distinction (i.e., each dependent is marked in the dependency structure as either an argument or an adjunct of the head).</Paragraph>
    <Paragraph position="4">  are marked as parents of their dependents in an ordered tree. It is worth noting that quite often there is no consensus on what the correct dependency structure for a particular sentence should be. To build a dependency Treebank, the Treebank annotators must decide which word depends on which word; for example, they have to decide whether the subject Vinken in Figure 1 depends on the modal verb will or the main verb join. In contrast, the annotators for phrase-structure Treebanks do not have to make such decisions. The users of phrase-structure Treebanks can modify the head percolation tables to get different dependency structures from the same phrase structure. In other words, phrase structures offer more flexibility than dependency structures with respect to the choices of heads.</Paragraph>
    <Paragraph position="5"> The feasibility of using the head percolation table to identify the heads in phrase structures depends on the characteristics of the language, the Treebank schema, and the definition of the correct dependency structure. For instance, the head percolation table for a strictly head-final (or head-initial) language is very easy to build, and the conversion algorithm works very well. For the English Penn Treebank, which we used in this paper, the conversion algorithm works very well except for the noun phrases with the appositive construction. For example, the conversion algorithm would choose the appositive the CEO of FNX as the head child of the phrase John Smith, the CEO of FNX, whereas the correct head child should be John Smith.</Paragraph>
  </Section>
  <Section position="3" start_page="1" end_page="6" type="metho">
    <SectionTitle>
3. CONVERTING DEPENDENCY STRUC-
TURES TO PHRASE STRUCTURES
</SectionTitle>
    <Paragraph position="0"> The main information that is present in phrase structures but not in dependency structures is the type of syntactic category (e.g., NP, VP, and S); therefore, to recover syntactic categories, any algorithm that converts dependency structures to phrase structures needs to address the following questions: Projections for each category: for a category X, what kind of projections can X have? Projection levels for dependents: Given a category Y depends on a category X in a dependency structure, how far should Y project before it attaches to X's projection? Attachment positions: Given a category Y depends on a category X in a dependency structure, to what position on X's projection chain should Y's projection attach? In this section, we discuss three conversion algorithms, each of which gives different answers to these three questions. To make the comparison easy, we shall apply each algorithm to the dependency structure (d-tree) in Figure 2 and compare the output of the algorithm with the phrase structure for that sentence in the English Penn Treebank, as in Figure 1.</Paragraph>
    <Paragraph position="1"> Evaluating these algorithms is tricky because just like dependency structures there is often no consensus on what the correct phrase structure for a sentence should be. In this paper, we measure the performance of the algorithms by comparing their output with an existing phrase-structure Treebank (namely, the English Penn Treebank) because of the following reasons: first, the Tree-bank is available to the public, and provides an objective although imperfect standard; second, one goal of the conversion algorithms is to make it possible to compare the performance of parsers that produce dependency structures with the ones that produce phrase structures. Since most state-of-the-art phrase-structure parsers are evaluated against an existing Treebank, we want to evaluate the conversion algorithms in the same way; third, a potential application of the conversion algorithms is to help construct a phrase-structure Treebank for one language, given parallel corpora and the phrase structures in the other language. One way to evaluate the quality of the resulting Treebank is to compare it with an existing Treebank.</Paragraph>
    <Section position="1" start_page="1" end_page="1" type="sub_section">
      <SectionTitle>
3.1 Algorithm 1
</SectionTitle>
      <Paragraph position="0"> According to X-bar theory, a category X projects to X', which further projects to XP. There are three types of rules, as shown in Figure 3(a). Algorithm 1, as adopted in [4, 3], strictly follows X-bar theory and uses the following heuristic rules to build phrase structures:</Paragraph>
      <Paragraph position="2"> (c) phrase structure (Arg) (Mod)(Spec) (1) XP -&gt; YP X' (2) X' -&gt; X' WP (3) X' -&gt; X ZP (a) rules in X-bar theory (b) d-tree  based on it) Two levels of projections for any category: any category X has two levels of projection: X' and XP. Maximal projections for dependents: a dependent Y always projects to Y' then YP, and the YP attaches to the head's projection. Fixed positions of attachment: Dependents are divided into three types: specifiers, modifiers, and arguments. Each type of dependent attaches to a fixed position, as shown in Figure 3(c). The algorithm would convert the d-tree in Figure 3(b) to the phrase structure in Figure 3(c). If a head has multiple modifiers, the algorithm could use either a single X' or stacked X' [3]. Figure 4 shows the phrase structure for the d-tree in Figure 2, where the algorithm uses a single X' for multiple modifiers of the same head.  tree in Figure 2</Paragraph>
    </Section>
    <Section position="2" start_page="1" end_page="3" type="sub_section">
      <SectionTitle>
3.2 Algorithm 2
</SectionTitle>
      <Paragraph position="0"> Algorithm 2, as adopted by Collins and his colleagues [2] when they converted the Czech dependency Treebank [6] into a phrase-structure Treebank, produces phrase structures that are as flat as possible. It uses the following heuristic rules to build phrase structures: null One level of projection for any category: X has only one level of projection: XP.</Paragraph>
      <Paragraph position="1">  To make the phrase structure more readable, we use N' and NP as the X' and XP for all kinds of POS tags for nouns (e.g., NNP, NN, and CD). Verbs and adjectives are treated similarly.</Paragraph>
      <Paragraph position="2"> Minimal projections for dependents: A dependent Y does not project to YPunless it has its own dependents.</Paragraph>
      <Paragraph position="3"> Fixed position of attachment: A dependent is a sister of its head in the phrase structure.</Paragraph>
      <Paragraph position="4">  The algorithm treats all kinds of dependents equally. It converts the pattern in Figure 5(a) to the phrase structure in Figure 5(b). Notice that in Figure 5(b), Y does not project to YP because it does not have its own dependents. The resulting phrase structure for the d-tree in Figure 2 is in Figure 6, which is much flatter than the one produced by Algorithm 1.</Paragraph>
    </Section>
    <Section position="3" start_page="3" end_page="5" type="sub_section">
      <SectionTitle>
3.3 Algorithm 3
</SectionTitle>
      <Paragraph position="0"> The previous two algorithms are linguistically sound. They do not use any language-specific information, and as a result there are several major differences between the output of the algorithms and the phrase structures in an existing Treebank, such as the Penn English Treebank (PTB).</Paragraph>
      <Paragraph position="1"> Projections for each category: Both algorithms assume that the numbers of projections for all the categories are the same, whereas in the PTB the number of projections varies from head to head. For example, in the PTB, determiners do not project, adverbs project only one level to adverbial phrases, whereas verbs project to VP, then to S, then to SBAR.</Paragraph>
      <Paragraph position="2">  Projection levels for dependents: Algorithm 1 assumes the maximal projections for all the dependents, while Algorithm 2 assumes minimal projections; but in the PTB, the level of projection of a dependent may depend on several factors such as the categories of the dependent and the head, the position of the dependent with respect to the head, and the dependency type. For example, when a  If a dependent Y has its own dependents, it projects to YP and YP is a sister of the head X; otherwise, Y is a sister of the head X.  S is similar to IP (IP is the maximal projection of INFL) in GB theory, so is SBAR to CP (CP is the maximal projection of Comp); therefore, it could be argued that only VP is a projection of verbs in the PTB. Nevertheless, because PTB does not mark INFL and Comp, we treat S and SBAR as projections of verbs.</Paragraph>
      <Paragraph position="3"> noun modifies a verb (or VP) such as yesterday in he came yesterday, the noun always projects to NP, but when a noun N  Attachment positions: Both algorithms assume that all the dependents of the same dependency type attach at the same level (e.g., in Algorithm 1, modifiers are sisters of X', while in Algorithm 2, modifiers are sisters of X); but in the PTB, that is not always true. For example, an ADVP, which depends on a verb, may attach to either an S or a VP in the phrase structure according to the position of the ADVP with respect to the verb and the subject of the verb. Also, in noun phrases, left modifiers (e.g., JJ) are sisters of the head noun, while the right modifiers (e.g., PP) are sisters of NP.</Paragraph>
      <Paragraph position="4"> For some applications, these differences between the Treebank and the output of the conversion algorithms may not matter much, and by no means are we implying that an existing Treebank provides the gold standard for what the phrase structures should be. Nevertheless, because the goal of this work is to provide an algorithm that has the flexibility to produce phrase structures that are as close to the ones in an existing Treebank as possible, we propose a new algorithm with such flexibility. The algorithm distinguishes two types of dependents: arguments and modifiers. The algorithm also makes use of language-specific information in the form of three tables: the projection table, the argument table, and the modification table. The projection table specifies the projections for each category. The argument table (the modification table, resp.) lists the types of arguments (modifiers, resp) that a head can take and their positions with respect to the head. For example, the entry V ! VP ! S in the projection table says that a verb can project to a verb phrase, which in turn projects to a sentence; the entry (P 0 1 NP/S) in the argument table indicates that a preposition can take an argument that is either an NP or an S, and the argument is to the right of the preposition; the entry (NP DT/JJ PP/S) in the modification table says that an NP can be modified by a determiner and/or an adjective from the left, and by a preposition phrase or a sentence from the right.</Paragraph>
      <Paragraph position="5"> Given these tables, we use the following heuristic rules to build phrase structures:  One projection chain per category: Each category has a unique projection chain, as specified in the projection table.</Paragraph>
      <Paragraph position="6"> Minimal projection for dependents: A category projects to a higher level only when necessary.</Paragraph>
      <Paragraph position="7"> Lowest attachment position: The projection of a dependent attaches to a projection of its head as lowly as possible.</Paragraph>
      <Paragraph position="8"> The last two rules require further explanation, as illustrated in Figure 7. In the figure, the node X has three dependents: Y and Z are arguments, and W is a modifier of X. Let's assume that the algorithm has built the phrase structure for each dependent. To form the phrase structure for the whole d-tree, the algorithm needs to attach the phrase structures for dependents to the projection chain</Paragraph>
      <Paragraph position="10"> of the head X. For an argument such as Z, suppose its projection chain is Z  and the root of the phrase structure headed by Z is Z</Paragraph>
      <Paragraph position="12"> in the phrase structure. Notice that based on the second heuristic rule (i.e., minimal projection for dependents), Z</Paragraph>
      <Paragraph position="14"> In theory, the last two heuristic rules may conflict each other in some cases. In those cases, we prefer the third rule over the second.</Paragraph>
      <Paragraph position="15"> In practice, such conflicting cases are very rare, if exist.</Paragraph>
      <Paragraph position="16"> this case although Z  The phrase structure produced by Algorithm 3 for the d-tree in Figure 2 is in Figure 8. In Figure 8, (a)-(e) are the phrase structures for five dependents of the head join; (f) is the projection chain for the head. The arrows indicate the positions of the attachment. Notice that to attach (a) to (f), the NNP Vinken needs to further project to NP because according to the argument table, a VP can take an NP, but not an NNP, as its argument.</Paragraph>
      <Paragraph position="17"> In the PTB, a modifier either sister-adjoins or Chomsky-adjoins to the modifiee. For example, in Figure 1, the MD will Chomsky-adjoins whereas the NP Nov. 29 sister-adjoins to the VP node. To account for that, we distinguish these two types of modifiers in the modification table and Algorithm 3 is extended so that it would attach Chomsky-adjoining modifiers higher by inserting extra nodes. To convert the d-tree in Figure 2, the algorithm inserts an extra VP node in the phrase structure in Figure 8 and attaches the MD will to the new VP node; the final phrase structure produced by the algorithm is identical to the one in Figure 1.</Paragraph>
    </Section>
    <Section position="4" start_page="5" end_page="6" type="sub_section">
      <SectionTitle>
3.4 Algorithm 1 and 2 as special cases of Al-
</SectionTitle>
      <Paragraph position="0"> gorithm 3  Note that once Z t becomes a child of X h , other dependents of X (such as W) that are on the same side as Z but are further away from X can attach only to X h or higher on the projection chain of X.</Paragraph>
      <Paragraph position="1"> Although the three algorithms adopt different heuristic rules to build phrase structures, the first two algorithms are special cases of the last algorithm; that is, we can design a distinct set of projection/argument/modification tables for each of the first two algorithms so that running Algorithm 3 with the associated set of tables for Algorithm 1 (Algorithm 2, respectively) would produce the same results as running Algorithm 1 (Algorithm 2, respectively). For example, to produce the results of Algorithm 2 with the code for Algorithm 3, the three tables should be created as follows:  (a) In the projection table, each head X has only one projection XP; (b) In the argument table, if a category Y can be an argument of a category X in a d-tree, then include both Y and YP as arguments of X; (c) In the modification table, if a category Y can be a modifier  of a category X in a d-tree, then include both Y and YP as sistermodifiers of XP.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML