File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/98/p98-2194_metho.xml

Size: 14,664 bytes

Last Modified: 2025-10-06 14:15:02

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-2194">
  <Title>An Underspecified Segmented Discourse Representation Theory (USDRT)</Title>
  <Section position="2" start_page="0" end_page="1189" type="metho">
    <SectionTitle>
2 From DRT to SDRT
</SectionTitle>
    <Paragraph position="0"> One obvious shortcoming DRT is that it lacks the rhetorical information that structures the text. This rhetorical information, expressed by discourse relations such as narration or background, has a crucial effect on anaphora resolution, lexical disambiguation, and spatial-temporal information. SDRT extends DRT in order to amend this insufficiency.</Paragraph>
    <Paragraph position="1"> Following Asher (1996) DRSs and SDRSs will be labelled ({K1,..., Kn}). Formally, an SDRS is recursively defined as a pair of sets containing labelled DRSs or SDRSs, and the discourse relations holding between them.</Paragraph>
    <Paragraph position="2"> Definition 1 (SDRS) Let K1 : ~l,...Kn : C~n be a labelled DRSs or SDRSs and R a set of discourse relations. The tuple &lt;U, Con) is an SDRS</Paragraph>
    <Paragraph position="4"> ditions. An SDRS condition is a discourse relation suchas D(K1,... ,Kn), where D 6 R.</Paragraph>
    <Paragraph position="5"> For the basic case (i.e. (K, 0)) K labels a DRS representing the semantic context of a sentence. A discourse relation introduces furthermore a hierarchical ordering indicated by a graph representation. The nodes represent the labelled SDRSs and the edges are discourse relations. Apart from the discourse relations, which impose a hierarchical ordering, 'topic' relations add more structure to this graph. If a sentence a is the topic of another sentence/3, this is formalised as a ~ /~.l This symbol also occurs in the graph, indicating a further SDRS condition. The graph representation illustrates the hierarchical structure of the discourse and in particular the open attachment site for newly processed sentences. Basically the constituents on the so-called 'right frontier' of the discourse structure are assumed to be available for further attachment (Webber, 1991).</Paragraph>
    <Paragraph position="6"> Assuming a current label (i.e. the one added after processing the last clause/sentence), a notion of I A further SDRS condition is Focus Background Pair (FBP) which is introduced by background.</Paragraph>
    <Paragraph position="7">  D-Subordination is defined by Asher (1996, p. 24).</Paragraph>
    <Paragraph position="8"> Generally speaking, all constituents which dominate the current label are open. A further restriction is introduced by the term D-Freedom which applies to all labels which are directly dominated by a topic, unless the label assigns the current node.</Paragraph>
    <Paragraph position="9"> Formally speaking, this can be phrased as: a label K is D-free in an SDRS ~ iff current(~) = K or  ploits discourse relations to establish a hierachical ordering of discourse segments. A constituent graph indicates the dependencies between the segment, especially highlighting the open attachment points.</Paragraph>
    <Paragraph position="10"> How the discourse relations such as narration or elaboration are derived is left to an axiomatic theory called DICE (Discourse in Commonsense Entailment) that uses a non-montonic logic. Taking the reader's world knowledge and Gricean-style pragmatic maxims into account, DICE provides a formal theory of discourse attachment. The main ingredients are defaults describing laws that encode the knowledge we have about the discourse relation and discourse processing. 2 The following discourse which is similar to example (1) exemplifies how SDRT deals with anaphora resolution within a sequence of sentences (Asher, 1996): (2) (kl) After thirty months, America is back in space. (k2) The shuttle Discovery roared off the pad from Cape Kennedy at 10:38 this morning.</Paragraph>
    <Paragraph position="11">  (k3) The craft and crew performed flawlessly.</Paragraph>
    <Paragraph position="12"> (k4) Later in the day the TDRS shuttle communication satellite was sucessfully deployed.</Paragraph>
    <Paragraph position="13"> (k5) This has given a much needed boost to NASA morale.</Paragraph>
    <Paragraph position="14">  :Formally, this is expressed by means of the Comonsense Entailment (CE) (Asher and Morreau, 1991).</Paragraph>
    <Paragraph position="15"> Note that this in (k5) can refer back either to (a) the entire Shuttle voyage or (b) the launch of the TDRS satellite in (k4). It can also be shown that this cannot be linked to the start of the shuttle described in (k2). The hierachical structure of the two first sentences is established by an elaboration relation. As a consequence, the SDRS labelled by K1 is the topic of /(2 (i.e. ({K1,K2}, {elaboration(K1, K2),K1 K2})). The next sentence (k3) is a comment to the situation described in the preceding sentence. However, a new constituent K~ has to be introduced into the discourse structure. This SDRS labelled by K~ subsumes the two DRSs in K2 and K3. As a side effect, the label K2 within the discourse relation elaboration(K1,K2) is changed to the newly introduced label K~ and a further edge is introduced between this SDRS and K3. It has to  be pointed out that this modification of the entire SDRS involves an overwriting of the structure derived so far. The SDRT update function has to be designed such that these changes are accordingly incorporated. Note furthermore that the introduction of an additional edge from K~ to K3 is not assigned with a discourse relation.</Paragraph>
    <Paragraph position="16"> In order to proceed with the SDRS construction, we have to consider which constituents are available for further attachment. According to the definition of D-Freedom and D-Subordination, the SDRS labelled by K1,//'2 and K3 are still available. 3 We derive using DICE that the next sentence (k4) is connected to (k2) via narration. The resulting constituent graph is shown in figure 3. A common topic as demanded by Asher (1996, p. 28) does not occur in the graph representation. Finally, only two attachment sites are left, namely K1 and  SDRSs with the SDRS derived for (k5). Consequently, two antecedents for the anaphora this can be resolved and the theory predicts two conceivable derivations: One SDRS contains the SDRS labelled by//'5 attached to K1, whereas the second conceivable SDRS exhibits K5 connected to//'4.</Paragraph>
    <Paragraph position="17"> Summing up, the formalism includes the following shortcomings: (a) The representation of an underspecified discourse is not possible in SDRT. All readings have to be generated. (b) The formalism is not monotonic. Updates may overwrite preceeding constituents. As it can be seen from figure 2 a new SDRS K~ substituted K2. 4 (c) The constituent graph contains a set of different SDRS con' ditions (i.e. discourse relations, ~, and FBP). It is not clear how these different conditions interact and it seems difficult to predict their effect on the discourse structure. Note that the update on narration requires a common topic which connects the two SDRSs according to the axioms stipulated within SDRT. However the ~ relation is not shown in the constituent graph.</Paragraph>
    <Paragraph position="18"> I will develop further ideas introduced by under-specified semantic formalisms which have been proposed in recent years (e.g. (Reyle, 1995)) in order to provide an underspecified representation for discourse structure. I will employ a first order tree logic by Kallmeyer (1996) to define an underspecifled SDRT, in the following sections.</Paragraph>
  </Section>
  <Section position="3" start_page="1189" end_page="1190" type="metho">
    <SectionTitle>
3 Tree Descriptions
</SectionTitle>
    <Paragraph position="0"> Tree Description Grammars (TDGs) were inspired by so-called quasi-trees (Vijay-Shanker, 1992). The grammar formalism is described as a constraint-based TAG-like grammar by Kallmeyer (1996). The logic used for TDGs is a quantifier-free first order 41t may be possible that the topic relation is transitive together with the d-subordination. However, this would contradict with the definition of D-Freedom (i.e. ~3K' (K' ~1. K)) logic consisting of variables for the nodes, four binary relations and the logical connectives -% A, V. 5 Definition 2 (TDG) A Tree Description Grammar  (TDG) is a tuple G = (N,T, &lt;1, &lt;*, -.&lt;, ~, S), such that: (a) N and T are disjoint finite sets for the nonterminal and terminal symbols.</Paragraph>
    <Paragraph position="1"> (b) &lt;~ is the parent relation (i.e. immediate dominance) which is irreflexive, asymmetric and intransitive. null (c) &lt;~* is the dominance relation which is the transitive closure of ,~.</Paragraph>
    <Paragraph position="2"> (d) -.4 is the linear precedence relation which is irreflexive, asymmetric and transitive.</Paragraph>
    <Paragraph position="3"> (e) ~ is the equivalence relation which is reflexive, symmetric and transitive.</Paragraph>
    <Paragraph position="4"> (f) S is the start description.</Paragraph>
    <Paragraph position="5">  The tree descriptions are formulae in TDGs reflecting the dominance relations between subtrees. Such formulae have to be negation-free and at least one k E K must dominate all other k' E K. In order to combine two tree descriptions an adjunction operation is used which simply conjoins the two tree descriptions. Graphically, this operation can take place at the dotted lines indicating the dominance relation (i.e. &lt;~*).The straight line describes the parent relation (,~). No adjunction can take place here. Figure 4 illustrates how the labels K~x and Kt r, and  We are now able to use this tree logic to describe the hierachical ordering within SDRT. This extends 5See Kallmeyer (1996) for a detailed description of how a sound and complete notion of syntactic consequence can be defined for this logic.</Paragraph>
    <Paragraph position="6">  the original approach, as we are also able to describe ambiguous structures.</Paragraph>
  </Section>
  <Section position="4" start_page="1190" end_page="1190" type="metho">
    <SectionTitle>
4 Underspecified SDRT (USDRT)
</SectionTitle>
    <Paragraph position="0"> Similar to proposals on underspecified semantic formalisms, the SDRSs are labelled and dominance relations hold between these labels. Note that also a precedence relation is used to specify the ordering between daughter nodes.</Paragraph>
    <Paragraph position="1"> Definition 3 (USDRS) Let S be a set of DRSs, L a set of labels, R a set of discourse relations. Then U is a USDRS confined to the tuple (S, L, R) where U is a finite set consisting of the following two kinds of conditions:  Generally speaking, a discourse relation P provides the link between DRSs or SDRSs. Similar to the standard SDRT account, this relation has to be derived by considering world knowledge as well as additional discourse knowledge, and is derived within DICE. I do not consider any changes of the standard theory in this respect. The structural information, however, is encoded by the tree descriptions as introduced in section 3. The most general case describing two situations connected by a (not yet known) discourse relation is formalised as shown in figure 5. 6 The description formula for this tree is K-r &lt;~* K~I A KT1 &lt;~ Kat A KR1 &lt;1 KRI' AKm &lt;1 K~i A K~I &lt;~* sl A K~I &lt;~* s2.</Paragraph>
    <Paragraph position="2"> Comparing this representation with the SDRT constituent graph, the following similarities and differences can be observed. First of all, the question of where the open attachment sites are found is easily observable in the structural restriction given by the  tree description. Graphically, the open nodes are indicated by the dotted lines. Secondly, a topic node is introduced, immediately dominating the discourse segment. No distinction between D-Subordination and D-Freedom has to be made, because the topic is open for further attachment as well. This is the main change to the discourse structure proposed by Schilder (1997). This account encodes the topic information in an additional feature called PROM1.</Paragraph>
    <Paragraph position="3"> However, it gives no formal definition of this term. I stick therefore to the topic definition Asher gives. But instead a uniform treatment of the hierarchical ordering can be given by the tree logic used. Thirdly, the discourse segment is dominated by the discourse relation that possesses two daughter nodes. The structure is flexible enough to allow further attachment here. No overwriting of a derived structure, as for the SDRT account, is necessary.</Paragraph>
    <Paragraph position="4"> If a discourse relation is derived, further constraints are imposed on the discourse structure. Basically, two cases can be distinguished: (a) A subordinating structure is triggered by discourse relations like narration or result. Consequently, the second situation becomes the topic (i.e. K~I : /3) and the precedence relation between K~I and K~I is introduced. In addition, the open attachment site on the right frontier gets closed (i.e. K~ 1 ~ K2). (b) A subordinated structure which comes with discourse relations like elaboration or background contains the first situation as a topic (i.e. K~I : a). For this structure a precedence relation between K~I and K~I also holds, but instead of the right frontier, the left frontier is closed (i.e. K~ 1 ~ K1). Generally speaking, the analysis proposed for (2) follows the SDRT account, especially regarding the derivation of the discourse relations. The first two sentences are connected via elaboration. However, the analysis differs with respect to the obtained discourse structure. Since sentence (kl) (i.e. the semantic content a) is the topic of this text segment  (i.e. (kl) and (k2)), a copy of a ends up in KT1 .</Paragraph>
    <Paragraph position="5"> The resulting tree description contains two node pairs where the dominance relation holds, indicated by the dotted line in the graphical representation.</Paragraph>
    <Paragraph position="6"> Hence there are two possible attachment sites. 7 The construction of the discourse sequence continues in the same way until sentence (k5). The ambiguity for this can be expressed as illustrated in figure 6. Sentence (k5) (i.e. 8s : ~) is connected via result with either K~I : o~ (i.e. this refers to the entire voyage in (kl)) or KT3 (i.e. only the launch of the satellite is referred to by this). Note furthermore that the latter reading requires that (k5) is an elaboration of (kl). Thus the USDRT analysis provides an underspecified representation of the discourse structure which covers the two possible readings of (2).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML