File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/h05-1031_abstr.xml
Size: 1,378 bytes
Last Modified: 2025-10-06 13:44:12
<?xml version="1.0" standalone="yes"?> <Paper uid="H05-1031"> <Title>Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pages 241-248, Vancouver, October 2005. c(c)2005 Association for Computational Linguistics Automatically Learning Cognitive Status for Multi-Document Summarization of Newswire</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Machine summaries can be improved by using knowledge about the cognitive status of news article referents. In this paper, we present an approach to automatically acquiring distinctions in cognitive status using machine learning over the forms of referring expressions appearing in the input. We focus on modeling references to people, both because news often revolve around people and because existing natural language tools for named entity identi cation are reliable. We examine two speci c distinctions whether a person in the news can be assumed to be known to a target audience (hearer-old vs hearer-new) and whether a person is a major character in the news story. We report on machine learning experiments that show that these distinctions can be learned with high accuracy, and validate our approach using human subjects.</Paragraph> </Section> class="xml-element"></Paper>