File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/w04-0808_intro.xml

Size: 1,615 bytes

Last Modified: 2025-10-06 14:02:35

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0808">
  <Title>An Evaluation Exercise for Romanian Word Sense Disambiguation</Title>
  <Section position="3" start_page="0" end_page="0" type="intro">
    <SectionTitle>
2 Open Mind Word Expert
</SectionTitle>
    <Paragraph position="0"> The sense annotated corpus required for this task was built using the Open Mind Word Expert system (Chklovski and Mihalcea, 2002), adapted to Romanian1. null To overcome the current lack of sense tagged data and the limitations imposed by the creation of such data using trained lexicographers, the Open Mind Word Expert system enables the collection of semantically annotated corpora over the Web.</Paragraph>
    <Paragraph position="1"> Sense tagged examples are collected using a Web-based application that allows contributors to annotate words with their meanings.</Paragraph>
    <Paragraph position="2"> The tagging exercise proceeds as follows. For each target word the system extracts a set of sentences from a large textual corpus. These examples are presented to the contributors, who are asked to select the most appropriate sense for the target word in each sentence. The selection is made using checkboxes, which list all possible senses of the current target word, plus two additional choices, &amp;quot;unclear&amp;quot; and &amp;quot;none of the above.&amp;quot; Although users are encouraged to select only one meaning per word, the selection of two or more senses is also possible. The results of the classification submitted by other users are not presented to avoid artificial biases.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML