File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/00/w00-0401_intro.xml

Size: 1,578 bytes

Last Modified: 2025-10-06 14:00:55

<?xml version="1.0" standalone="yes"?>
<Paper uid="W00-0401">
  <Title>Concept Identification and Presentation in the Context of Technical Text Summarization</Title>
  <Section position="3" start_page="0" end_page="0" type="intro">
    <SectionTitle>
2 Text Summarization
</SectionTitle>
    <Paragraph position="0"> The process of producing a summary from a source text consists of the following steps: (i) the interpretation of the text; (ii) the extraction of the relevant information which ideally includes the &amp;quot;topics&amp;quot; of the source; (iii) the condensation of the extracted information and construction of a summary representation; and (iv) the presentation of the summary representation to the reader in natural language.</Paragraph>
    <Paragraph position="1"> While some techniques exist for producing summaries for domain independent texts (Luhn, 1958; Marcu, 1997) it seems that domain specific texts require domain specific techniques (DeJong, 1982; Paice and Jones, 1993). In our case, we are dealing with technical articles which are the result of the complex process of scientific inquiry that starts with the. identification of a knowledge problem and eventually culminates with the discovery of an answer to it. Even if authors of technical articles write about several concepts in their articles, not all of them are topics. In order to address the issue of topic identification, content selection and presentation, we have studied alignments (manually produced) of sentences from professional abstracts with sentences from</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML