File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/02/w02-1411_intro.xml

Size: 1,341 bytes

Last Modified: 2025-10-06 14:01:38

<?xml version="1.0" standalone="yes"?>
<Paper uid="W02-1411">
  <Title>Acquisition of Lexical Paraphrases from Texts</Title>
  <Section position="3" start_page="2" end_page="2" type="intro">
    <SectionTitle>
3 Approach and Implementation
</SectionTitle>
    <Paragraph position="0"> This section describes our approach to acquiring paraphrase knowledge from a text corpus. We use Perl programming language to implement all of the following processes and experiments.</Paragraph>
    <Section position="1" start_page="2" end_page="2" type="sub_section">
      <SectionTitle>
3.1 Collection of context from corpus
</SectionTitle>
      <Paragraph position="0"> We first define the term context in this paper.</Paragraph>
      <Paragraph position="1"> The context of a certain content word is defined as direct dependency relations between the word and the words that surround it in texts.</Paragraph>
      <Paragraph position="2"> That is, the context of a content word c is, in our sense, defined as the collection of words upon which c directly depends as well as the collection of words that directly depend on c.</Paragraph>
      <Paragraph position="3"> Under this definition, we first collected all of the dependency relations observed in the corpus. Each article is segmented and part-of-speech tagged by the morphological analyzer</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML