File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/c04-1057_abstr.xml
Size: 975 bytes
Last Modified: 2025-10-06 13:43:19
<?xml version="1.0" standalone="yes"?> <Paper uid="C04-1057"> <Title>A Formal Model for Information Selection in Multi-Sentence Text Extraction</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Selecting important information while accounting for repetitions is a hard task for both summarization and question answering. We propose a formal model that represents a collection of documents in a two-dimensional space of textual and conceptual units with an associated mapping between these two dimensions.</Paragraph> <Paragraph position="1"> This representation is then used to describe the task of selecting textual units for a summary or answer as a formal optimization task. We provide approximation algorithms and empirically validate the performance of the proposed model when used with two very different sets of features, words and atomic events.</Paragraph> </Section> class="xml-element"></Paper>