File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/w04-3247_intro.xml

Size: 1,499 bytes

Last Modified: 2025-10-06 14:02:53

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-3247">
  <Title>LexPageRank: Prestige in Multi-Document Text Summarization</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> Text summarization is the process of automatically creating a compressed version of a given text that provides useful information for the user. In this paper, we focus on multi-document generic text summarization, where the goal is to produce a summary of multiple documents about the same, but unspecified topic.</Paragraph>
    <Paragraph position="1"> Our summarization approach is to assess the centrality of each sentence in a cluster and include the most important ones in the summary. In Section 2, we present centroid-based summarization, a well-known method for judging sentence centrality. Then we introduce two new measures for centrality, Degree and LexPageRank, inspired from the &amp;quot;prestige&amp;quot; concept in social networks and based on our new approach. We compare our new methods and centroid-based summarization using a feature-based generic summarization toolkit, MEAD, and show that new features outperform Centroid in most of the cases.</Paragraph>
    <Paragraph position="2"> Test data for our experiments is taken from Document Understanding Conferences (DUC) 2004 summarization evaluation to compare our system also with other state-of-the-art summarization systems.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML