File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-0608_abstr.xml

Size: 972 bytes

Last Modified: 2025-10-06 13:45:18

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-0608">
  <Title>The Hinoki Sensebank -- A Large-Scale Word Sense Tagged Corpus of Japanese --</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Semantic information is important for precise word sense disambiguation system and the kind of semantic analysis used in sophisticated natural language processing such as machine translation, question answering, etc. There are at least two kinds of semantic information: lexical semantics for words and phrases and structural semantics for phrases and sentences.</Paragraph>
    <Paragraph position="1"> We have built a Japanese corpus of over three million words with both lexical and structural semantic information. In this paper, we focus on our method of annotating the lexical semantics, that is building a word sense tagged corpus and its properties.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML