File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/w04-0801_intro.xml

Size: 2,070 bytes

Last Modified: 2025-10-06 14:02:34

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0801">
  <Title>The Basque lexical-sample task</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> This paper reviews the Basque lexical-sample task organized for Senseval 3. Each participant was provided with a relatively small set of labelled examples (2/3 of 75+15*senses+7*multiwords) and a comparatively large set of unlabelled examples (roughly ten times more when possible) for around 40 words. The larger number of unlabelled data was released with the purpose to enable the exploration of semi-supervised systems.</Paragraph>
    <Paragraph position="1"> The test set comprised 1/3 of the tagged examples.</Paragraph>
    <Paragraph position="2"> The sense inventory was taken from the Basque WordNet, which is linked to WordNet version 1.6 (Fellbaum, 1998). The examples came mainly from newspaper texts, although we also used a balanced in-house corpus and texts from Internet. The words selected for this task were coordinated with other lexical-sample tasks (such as Catalan, English, Italian, Romanian and Spanish) in order to share around 10 of the target words.</Paragraph>
    <Paragraph position="3"> The following steps were taken in order to carry out the task:  (*) Authors listed in alphabetic order.</Paragraph>
    <Paragraph position="4"> 1. set the exercise a. choose sense inventory from a pre-existing resource b. choose target corpora c. choose target words d. lemmatize the corpus automatically e. select examples from the corpus 2. hand-tagging a. define the procedure b. revise the sense inventory c. tag d. analyze the inter-tagger agreement e. arbitrate  This paper is organized as follows: The following section presents the setting of the exercise. Section 3 reviews the hand-tagging, and Section 4 the details of the final release. Section 5 shows the results of the participant systems. Section 6 discusses some main issues and finally, Section 7 draws the conclusions.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML