File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/04/w04-0210_metho.xml

Size: 30,372 bytes

Last Modified: 2025-10-06 14:09:03

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0210">
  <Title>Discourse Annotation and Semantic Annotation in the GNOME Corpus</Title>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3 Identifying Utterances
</SectionTitle>
    <Paragraph position="0"> In order to use a corpus to study salience, it is essential to find a way to annotate what in Center- null of the collection of all patient leaflets in the UK which was digitized to support the ICONOCLAST project at the University of Brighton (Scott et al., 1998).</Paragraph>
    <Paragraph position="1"> ing theory (Grosz et al., 1995) are called UTTER-ANCES, i.e., the units of text after which the local focus is updated. In most annotations concerned with salience, a predefined notion of utterance was adopted, typically sentences (Miltsakaki, 2002) or (finite) clauses (Kameyama, 1998). This approach, however, precludes using the corpus to compare possible definitions of utterance, one of the goals of the GNOME annotation (Poesio et al., 2004b).</Paragraph>
    <Paragraph position="2"> In order to do this, we marked all spans of text that might be claimed to update the local focus, including sentences (defined as all units of text ending with a full stop, a question mark, or an exclamation point) as well as what we called (DISCOURSE) UNITS. Units include clauses (defined as sequences of text containing a verbal complex, all its obligatory arguments, and all postverbal adjuncts) as well as other sentence subconstituents that might be viewed as independently updating the local focus, such as parentheticals, preposed PPs, and (the  second element of) coordinated VPs. Examples of clauses, verbal and non-verbal parentheticals, and preposed PPs marked as units follow; the parentheses indicate unit boundaries. (Sentence boundaries are not indicated.) (1) a. clausal unit with non-verbal parenthetical: (It's made in the shape of a real object (- a violin)) b. clausal unit with preposed PP and em null bedded relative clauses: ((With the development of heraldry in the later Middle Ages in Europe as a means of identification), all (who were entitled (to bear arms)) wore signet-rings (engraved with their armorial bearings)) As example (1b) above illustrates, subordinate units such as clausal complements and relative clauses were enclosed within the superordinate unit. Subordinate units also include adjunct clauses headed by connectives such as before, after, because and clauses in subject position. In total, the texts used for the main study contain 505 sentences and more than 1,000 units, including 900 finite clauses.3 Sentence and Unit Attributes Sentences have one attribute, STYPE, specifying whether the sentence is declarative, interrogative, imperative, or exclamative. The attributes of units include: * UTYPE: whether the unit is a main clause, a relative clause, appositive, a parenthet3Our instructions for marking up such elements benefited from the discussion of clauses in (Quirk and Greenbaum, 1973) and Marcu's proposals for discourse units annotation (1999). ical, etc. The possible values for this attribute are main, relative, such-as, appositive, parenthetical, paren-rel, paren-app, parenmain, subject, complement, adjunct, coord-vp,preposed-pp, listitem, cleft, title, disc-marker.</Paragraph>
    <Paragraph position="3"> * VERBED: whether the unit contains a verb.</Paragraph>
    <Paragraph position="4"> * FINITE: for verbed units, whether the verb is finite or not.</Paragraph>
    <Paragraph position="5"> * SUBJECT: for verbed units, whether they have a full subject, an empty subject (expletive, as in there sentences), or no subject (e.g., for infinitival clauses).</Paragraph>
    <Paragraph position="6"> Annotation Issues Marking up sentences proved to be quite easy; marking up units, on the other hand, required extensive annotator training. The agreement on identifying the boundaries of units, using the k statistic discussed in (Carletta, 1996), was k = .9 (for two annotators and 500 units); the agreement on features (2 annotators and at least 200 units) was as follows: UTYPE: k=.76; VERBED: k=.9; FINITE: k=.81. The main problems when marking units were to identify complements, to distinguish clausal adjuncts from prepositional phrases, and how to mark up coordinated units. The main problem with complements was to distinguish non-finite complements of verbs such as want from the non-finite part of verbal complexes containing modal auxiliaries such as get, let, make, and have: (2) a. (I would like (to be able to travel)) b. (I let him do his homework) One problem that proved fairly difficult to handle (and which, in fact, we didn't entirely solve) was clausal coordination. The problem was to preserve enough structure to be able to compute the previous utterance, while preserving some basic intuitions about what constitutes a clause (roughly, that by and large clauses were text spans marked either by the presence of a semantically isolated verb or by punctuation / layout) which are essential for annotators and are needed to specify the values of attributes. This was relatively easy to do when two main clauses were coordinated; coordinated main clauses were marked as in (3a). However, it wasn't completely obvious what to do in the case of coordination within a subordinate clause, as in (3b). Because there weren't many such cases, rather than using the &lt;unit&gt; element with a special value for UTYPE as we did for coordinated NPs (which meant specifying all sorts of special values for attributes) we used a markup element called &lt;unit-coordination&gt; to maintain the structure, and then marked up each clause separately, as shown in (3c) (the &lt;unit-coordination&gt; is marked with square brackets).</Paragraph>
    <Paragraph position="7">  (3) a. (The Getty museum's microscope still  works,) (and the case is fitted with a drawer filled with the necessary attachments). null b. (If you have any questions or are not sure about anything, ask your doctor or your pharmacist) c. ((If [(you have any questions) or (you are not sure about anything)]), ask your doctor or your pharmacist) The elements of text not marked up as units include: NPs, post-verbal and post-nominal PPs, non- null verbal NP modifiers, coordinated VPs in case the second conjunct did not have arguments (4a), and quoted parts of text, when not reported speech (4b). (4) a. (The oestradiol and norethisterone acetate are plant derived and synthetically produced) null b. (The inscription 'CHNETOC</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
BASHLHKOC CPATHARHC')
</SectionTitle>
    <Paragraph position="0"> Layout Our genres raised a few issues that, as far as we know, have not been previously discussed in the Centering literature. One such problem is what to do with layout elements such as titles and list elements, which can clearly serve as the first introduction of a CF and to move the CB. One example of title unit is unit (u1) in (5).</Paragraph>
    <Paragraph position="1"> (5) (u1) Side effects Side effects may occur when PRODUCT-Y is applied to large parts of the body, We marked these layout elements as units, as in (6), but using the special value title of the attribute  UTYPE (see above) so that we could test whether it was better to treat them as utterances or not.</Paragraph>
    <Paragraph position="2"> (6) &lt;unit id=&amp;quot;u1&amp;quot; utype=&amp;quot;title&amp;quot;&gt;Side  effects&lt;/unit&gt; &lt;p&gt;&lt;s stype=&amp;quot;decl&amp;quot;&gt;&lt;unit&gt; Side effects may occur &lt;unit&gt;when PRODUCT-Y is applied to large parts of the body, ... &lt;/unit&gt; ... &lt;/unit&gt; ... &lt;/s&gt; ... &lt;/p&gt; Problems with Attributes The most difficult at- null tribute to mark was UTYPE, and our main problem was to distinguish between relative clauses and parentheticals, since it's not always easy to tell whether a relative clause is restrictive or non-restrictive (see also (Cheng et al., 2001)). In the end, we adopted rules purely based on surface form (the presence or absence of a comma or other bracketing device). (See also (Quirk and Greenbaum, 1973).) Utterances and Propositions The annotation of units has been shown useful to identify many of the atomic propositions expressed by a text, and was therefore used as a basis for studying text planning (Karamanis, 2003) and aggregation (Cheng, 2001).</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4 Properties of Discourse Entities and
</SectionTitle>
    <Paragraph position="0"> their Realization The main goal of the GNOME annotation was to study the factors that affect the realization of discourse entities, focusing on those entities realized as NPs. Hence, our main concern was to identify and to annotate the relevant properties both of discourse entities themselves and their realizations in a particular utterance (which we will call FORWARD LOOK-ING CENTERS, or CFs, following Centering's terminology). Both types of properties were annotated as properties of the &lt;ne&gt; element, used to mark up NPs in the corpus. Overall, we annotated 14 attributes of &lt;ne&gt; elements, specifying the syntactic and semantic properties of NPs and the semantic properties of the discourse entities they realize. We discuss these attributes in this section. We also annotated semantic relations between discourse entities, particularly when they express anaphoric relations. Anaphoric annotation is discussed in the next section.</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
4.1 Marking up NEs
</SectionTitle>
      <Paragraph position="0"> The &lt;ne&gt; element is used to mark NPs, as in the following example (the attributes will be discussed  &lt;ne id=&amp;quot;ne4&amp;quot; cat=&amp;quot;bare-np&amp;quot; per=&amp;quot;per3&amp;quot; num=&amp;quot;plur&amp;quot; gen=&amp;quot;neut&amp;quot; gf=&amp;quot;obj&amp;quot; lftype=&amp;quot;term&amp;quot; onto=&amp;quot;person&amp;quot; ani=&amp;quot;animate&amp;quot; deix=&amp;quot;deix-no&amp;quot; count=&amp;quot;count-yes&amp;quot; structure=&amp;quot;set&amp;quot; generic=&amp;quot;generic-yes&amp;quot; loeb=&amp;quot;sort&amp;quot;&gt; scholars &lt;/ne&gt; &lt;unit finite='finite-no' id='u4' utype='complement' verbed='verbed-yes'&gt; to link &lt;ne id=&amp;quot;ne5&amp;quot; cat=&amp;quot;pers-pro&amp;quot; per=&amp;quot;per3&amp;quot; num=&amp;quot;sing&amp;quot; gen=&amp;quot;neut&amp;quot; gf=&amp;quot;obj&amp;quot; lftype=&amp;quot;term&amp;quot; onto=&amp;quot;concrete&amp;quot; ani=&amp;quot;inanimate&amp;quot; deix=&amp;quot;deix-yes&amp;quot; count=&amp;quot;count-yes&amp;quot; structure=&amp;quot;atom&amp;quot; generic=&amp;quot;generic-no&amp;quot; loeb=&amp;quot;disc-function&amp;quot;&gt; it &lt;/ne&gt; ...</Paragraph>
      <Paragraph position="1">  The GNOME instructions for identifying NPs derive from those proposed in MATE (Poesio et al., 1999), in turn derived from DRAMA (Passonneau, 1997) and MUC-7 (Hirschman, 1998). An important difference between the instructions used for GNOME and those developed for MATE is that instead of attempting to get the annotators to recognize the NP that realize discourse entities and only mark those, in GNOME all NPs were marked with &lt;ne&gt; elements; the separate LF TYPE attribute was used to distinguish between NPs with different types of denotations (see below). This change made the process of identifying nominal entities easier and potentially automatic (even though the identification of markables was still done by hand).</Paragraph>
      <Paragraph position="2"> As in the case of units, the main problem with marking up NPs was coordination. Our approach was to use a separate &lt;ne&gt; element to mark up the coordinated NP, with type (CAT) value coord-np.</Paragraph>
      <Paragraph position="3"> We only used a coord-np element if two determiners were present, as in ((your doctor) and (your pharmacist)). This approach was chosen because it limited the number of spurious coordinations introduced (in cases such as this is an interesting and well-known example of early Byzantine jewellery), but has the limitation that only one &lt;ne&gt; is marked in cases such as Your doctor or pharmacist.</Paragraph>
    </Section>
    <Section position="2" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
4.2 Properties of all NPs
</SectionTitle>
      <Paragraph position="0"> Some of the attributes of &lt;ne&gt; elements specify properties of all NPs, whether or not they realize a discourse entity. We discuss these first.</Paragraph>
      <Paragraph position="1"> CAT The CAT attribute is used to mark NP type: whether the NP is a pronoun, a definite description, etc.. This attribute is only meant to provide a very surface-y classification, without attempting to group NPs in larger classes such as 'definite NP' and 'indefinite NP'. The one attempt to go beyond pure surface was the introduction of a distinction between definite descriptions that are really disguised proper names such as the Beatles, classified as CAT=the-pn, and all other definite descriptions, classified as the-np. The complete list of values for CAT is: a-np, another-np, q-np, num-np, meas-np, that-np, this-np, such-np, wh-np, poss-np, bare-np, pn, the-pn, the-np, pers-pro, poss-pro, refl-pro, rec-pro, q-pro, wh-pro, this-pro, that-pro, num-ana (for 'numerical anaphors' such as one in I want one), null-ana, gerund (for nominalized present participles such as veneering furniture in the practice of veneering furniture), coord-np, and free-rel (for 'free relatives' such as what you need most in what you need most is a good rest)).</Paragraph>
      <Paragraph position="2"> The agreement on this attribute was pretty high, k = .9; the one problem was the distinction between the-pn and the-np.</Paragraph>
      <Paragraph position="3"> Agreement features: NUM, PER, and GEN These atributes are used to annotate features that are important to study pronoun interpretation: gender, number and person of NPs. Person and number were generally easy to annotate, but gender was very difficult because of the presence of many references to individual of unspecified gender, such as the maker in the inventory gives neither the name of the maker nor the location. This problem was solved by introducing a special undersp-gen value; indeed, underspecified values were provided for all attributes. The agreement values for these features were: GEN: k = .89; NUM: k = .84; PER: k = .9.</Paragraph>
      <Paragraph position="4"> GF This attribute was used to annotate the grammatical function of the NP, a property generally taken to play an important role in determining the salience of the discourse entity it realizes (Grosz et al., 1995). Our instructions for this attribute are derived from those used in the FRAMENET project ((Baker et al., 1998); see also http://www.icsi.berkeley.edu/~framenet/).</Paragraph>
      <Paragraph position="5"> The values are subj, obj, predicate (used for post-verbal objects in copular sentences, such as This is (a production watch)), there-obj (for post-verbal objects in there-sentences), comp (for indirect objects), adjunct (for the argument of PPs modifying VPs), gen (for NPs in determiner position in possessive NPs), np-compl, np-part, np-mod, adj-mod, and no-gf (for NPs occurring by themselves - eg., in titles). The agreement values for GF is k = .85.</Paragraph>
      <Paragraph position="6"> LF TYPE Not all NPs realize discourse entities: some of them realize quantifiers (e.g., each coffer in Each coffer has a lid) or predicates (e.g., NPs in appositive position, such as the oldest son of Louis XIV in The 1689 inventory of the Grand Dauphin, the oldest son of Louis XIV, lists a jewel coffer of similar form and decoration. As said above, in the GNOME annotation all NPs are treated as markables, but the LF TYPE attribute is used to indicate the type of semantic object denoted by an NP: term, quant or pred. Quantifiers were identified purely on the basis of the value of the CAT value: all NPs with CAT=q-np or q-pro should get a value of quant. A more complex test was used to identify predicative NPs: three linguistic contexts in which NP are typically predicative were considered (appositions, postcopular position in there-sentences, and become-style sentences) but the annotators were explicitly asked to check whether the NP was used to express a property. Agreement was more tentative: k = .73 (for two annotators, 200 NPs).</Paragraph>
      <Paragraph position="7"> Taxonomic information Two semantic attributes capture information about the type of objects referred to (or quantifier over) by an NP. The first attribute, ONTO, was originally introduced to distinguish between gerunds (event nominalizations such as letter-writing) and bare plurals referring to concrete objects like scholars, both of which semantically denote collective objects (Link, 1983; Portner, 1992). Further distinctions were introduced to deal with 'difficult' objects, such as diseases; particular types of concrete objects such as medicines and persons were also singled out. Distinctions captured by the current set of values of ONTO include persons, medicines, other substances, other concrete objects; events, time intervals, or other abstract entities; spatial locations; and diseases. The agreement value for the latest version of ONTO was k = .8 between two annotators, 200 NPs.</Paragraph>
      <Paragraph position="8"> The second 'taxonomic' attribute, ANI, is used to annotate whether the objects referred to or quantifier over by an NP are animate or inanimate. This annotation was motivated by a number of studies suggesting that animacy plays an important role in salience (Prat-Sala and Branigan, 2000) and our own experiments suggesting that animacy is much more important than grammatical function, thematic roles, or order of mention in determining which entities are most likely to be pronominalized (Pearson et al., 2001). We also found that the discrepancy between the results of Gordon et al. (1999) and the findings of (Walker and Prince, 1996) can be explained in terms of animacy (Poesio and Nissim, 2001). Animacy was by far the easiest semantic attribute for our annotators: k = .92.</Paragraph>
    </Section>
    <Section position="3" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
4.3 Semantic properties of Discourse Entities
</SectionTitle>
      <Paragraph position="0"> Semantic properties that may play a role in realization but only apply to discourse entities include:4 Structure Two attributes are used to indicate whether the discourse entity realized by an NP refers to a mass of certain substance or to countable objects (attribute COUNT) and, in case of countable objects, to an atom or a set (attribute STRUCTURE).</Paragraph>
      <Paragraph position="1"> These attributes were marked in order to study the 4These attributes were only marked for about 25% of the corpus.</Paragraph>
      <Paragraph position="2"> factors leading to the realization of a discourse entity as a bare NP, in combination with the annotation of genericity discussed below: the reasoning being that it should only be possible to use bare singulars to realize a discourse entity described with mass nouns (as in the ebeniste and his wife lived modestly in a five-room apartment . . . with simple furniture).5 The main reason for keeping the two attributes separate was that reaching agreement on STRUCTURE was fairly easy (k = .82 at the second attempt) whereas COUNT was one of the most difficult attributes to mark-it took several iterations of changes to the instructions to achieve a k = .78, and substantial revisions would probably still be useful.</Paragraph>
      <Paragraph position="3"> Nevertheless, given currently accepted views deriving from Link's work (1983), it would make more sense to merge the two attributes.</Paragraph>
      <Paragraph position="4"> GENERIC This attribute is used to indicate whether the NP should be interpreted generically or not, which was thought to affect at least two types of discourse entity realizations: gerunds, that we took to be event types, and bare NPs, both singular and referring to substances (e.g., ivory) and plural.</Paragraph>
      <Paragraph position="5"> Annotating this information proved to be very difficult, which was not surprising because genericity is not yet a completely understood phenomenon. One complication is that there are two types of 'generic NPs': NPs referring to kinds, such as The dodo in The dodo is extinct (being extinct is not a property that can be predicated of individual dodos), and NPs used in generic statements, such as Italians are good skiers (a property of individual Italians) (Carlson and Pelletier, 1995). Although some NPs can only be used to express one or the other interpretation (e.g., * A dodo is extinct), many can be used in both ways (Dodos are extinct).</Paragraph>
      <Paragraph position="6"> We started trying to make the very basic distinction between tokens and types one finds, e.g., in (Lyons, 1977), but even after numerous refinements we still encountered many problems. One of the problems our annotators had was whether to treat references to substances such as ivory and horn in examples like This table's marquetry of ivory and horn 'existentially,' i.e., as referring to the particular amounts of those substances used in the table, or 'generically', to refer to the kinds. In the end we decided to follow Carlson (1977) and to mark all of these examples as references to kinds, i.e., as generic. A second problem were quantifiers.</Paragraph>
      <Paragraph position="7"> Our annotators found it very hard to distinguish 5Apart from the cases in which bare singulars are used to refer to substances, such as the interiors of this pair of coffers are lined with tortoiseshell and brass, the few discussed exceptions to this rule are expressions like home in I went home.</Paragraph>
      <Paragraph position="8"> between quantified NPs used (non-generically) to quantify over a specific set of individuals at a particular spatio-temporal location, as in Many lecturers went on strike (on March 16th, 2004), and quantifiers used in generic sentences, as in Many lecturers went (habitually) on strike (during those years). The last version of the instructions (not yet added to the overall annotation manual) asked annotators to try to identify generic sentences before attempting to determine the value of the GENERIC attribute.</Paragraph>
      <Paragraph position="9"> With these instructions, we finally reached a reasonable agreement (k = .82).</Paragraph>
      <Paragraph position="10"> LOEB Poesio and Vieira (1998) found that of the 1,400 definite descriptions in their corpus, only about 50% were subsequent mention or bridging references, whereas 50% were first mentions. Of the first mentions, about half (i.e., 25% of the total) were what Hawkins (1978) would call 'larger situation' definites, i.e., definite descriptions like the pope whose referent is supposed to be part of shared knowledge; whereas the other half includes what Loebner (1987) calls SEMANTICALLY FUNC-TIONAL definites, like the first man on the Moon.</Paragraph>
      <Paragraph position="11"> Loebner claimed that the paradigmatic case of definiteness are not anaphoric NPs, as suggested by familiarity theories such as Heim's (1982), but semantically functional ones such as the first person ever to row across the Pacific on his own. In order to test Loebner's theory and compare it with one based on familiarity, we annotated the NPs referring to discourse entities according to whether they were functional, relational, or sortal (Poesio, 2004a). We achieved good reliability on this attribute (k = .82), and the results do suggest a much greater correlation between functionality and definiteness than between familiarity and definiteness (Poesio, 2004a).</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="0" end_page="0" type="metho">
    <SectionTitle>
5 Anaphora
</SectionTitle>
    <Paragraph position="0"> The one aspect of the GNOME annotation that has been extensively discussed in previous papers is anaphoric annotation (Poesio, 2004b; Poesio et al., 2004b); we only discuss this aspect briefly here.</Paragraph>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
5.1 Annotating Discourse Models
</SectionTitle>
      <Paragraph position="0"> Anaphoric annotation raises a number of difficult and, sometimes, unresolved semantic issues (Poesio, 2004b). As part of the MATE and GNOME projects, an extensive analysis of previously existing schemes for so-called 'coreference annotation,' such as the MUC-7 scheme, was carried out, highlighting a number of problems with such schemes, ranging from issues with the annotation methodology to semantic issues. Proposals for annotating 'coreference' such as (Hirschman, 1998) have been motivated by work on Information Extraction, hence the notion of 'coreference' used is very difficult to relate to traditional ideas about anaphora (van Deemter and Kibble, 2000). A distinctive feature of the GNOME annotation (and the MATE proposals from which they derive (Poesio, 2004b)) are explicitly based on the DISCOURSE MODEL assumption adopted almost universally by linguists (computational and not) working on anaphora resolution and generation (Webber, 1979; Heim, 1982; Kamp and Reyle, 1993; Gundel et al., 1993). This is the hypothesis that interpreting a discourse involves building a shared discourse model containing DISCOURSE ENTITIES that may or may not 'refer' to specific objects in the world, as well as the relations between these entities. The annotation for which the MATE scheme was developed-that we'll call here 'anaphoric annotation,' is meant as a partial representation of the discourse model evoked by a text.</Paragraph>
    </Section>
    <Section position="2" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
5.2 Anaphoric Annotation in GNOME
</SectionTitle>
      <Paragraph position="0"> For the GNOME corpus, we adopted a simplified version of the MATE scheme, as for our purposes it's not essential to mark all semantic relations between entities introduced by a text, but only those that may establish a 'link' between two utterances. So, for example, it was not necessary for us to mark a relation between the subject of a copular sentence and its predicate - e.g., between the price of aluminum siding and $3.85 or $4.02 in the example above.</Paragraph>
      <Paragraph position="1"> In the GNOME corpus, anaphoric information is marked by means of a special &lt;ante&gt; element; the &lt;ante&gt; element itself specifies the index of the anaphoric expression (a &lt;ne&gt; element) and the type of semantic relation (e.g., identity), whereas one or more embedded &lt;anchor&gt; elements indicate possible antecedents.6 (See (8).)  Work such as (Sidner, 1979; Strube and Hahn, 1999), as well as our own preliminary analysis, suggested that indirect realization can play a crucial role in maintaining the CB. However, previous attempts at marking anaphoric information, particularly in the context of the MUC initiative, suggested that while agreement on identity relations is 6The presence of more than one &lt;anchor&gt; element indicates that the anaphoric expression is ambiguous.</Paragraph>
      <Paragraph position="2"> fairly easy to achieve, marking bridging references is hard; this was confirmed by Poesio and Vieira (1998). For these reasons, and to reduce the annotators' work, we did not mark all relations. Besides identity (IDENT) we only marked up three associative relations (Hawkins, 1978): set membership (ELEMENT), subset (SUBSET), and 'generalized possession' (POSS), which includes part-of relations as well as ownership relations. We only marked relations between objects realized by noun phrases, excluding anaphoric references to actions, events or propositions implicitly introduced by clauses or sentences. We also gave strict instructions to our annotators limiting how much to mark.</Paragraph>
      <Paragraph position="3"> As expected, we found a reasonable (if not perfect) agreement on identity relations. In our most recent analysis (two annotators looking at the anaphoric relations between 200 NPs) we observed no real disagreements; 79.4% of the relations were marked up by both annotators; 12.8% by only one of them; and in 7.7% of the cases, one of the annotators marked up a closer antecedent than the other. With associative references, limiting the relations did limit the disagreements among annotators (only 4.8% of the relations are actually marked differently) but only 22% of bridging references were marked in the same way by both annotators; 73.17% of relations are marked by only one or the other annotator. So reaching agreement on this information involved several discussions between annotators and more than one pass over the corpus.</Paragraph>
    </Section>
  </Section>
  <Section position="8" start_page="0" end_page="0" type="metho">
    <SectionTitle>
6 Automatically computing the Local
Focus
</SectionTitle>
    <Paragraph position="0"> The reader will have noticed that no attempt was done to directly mark up properties of the local focus - e.g., which discourse entity is the CB of a particular utterance. We found that it is much easier to annotate the 'building blocks' of a theory of the local focus, and then use scripts to automatically compute the CB. There are two advantages to this approach: first of all, agreement on the 'building blocks' is much easier to reach than agreement on the CB-in our preliminary experiments we didn't go beyond k = .6 when trying to directly identify the CB using the definitions from (Brennan et al., 1987).</Paragraph>
    <Paragraph position="1"> And secondly, this approach makes it possible to compute the CB according to different ways of instantiating what we call the 'parameters of Centering' -e.g., ranking.</Paragraph>
    <Paragraph position="2"> We developed such scripts for the work discussed in (Poesio et al., 2004b); they can be tested on the web site associated with that paper, http://cswww.essex.ac.uk/staff/poesio/ cbc/. These scripts have been subsequently used to compute the CB in, e.g., (Poesio and Nissim, 2001; Poesio and Nygren-Modjeska, To appear).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML