NAA CLoANLP 2000 Workshop 
i • Automatic Summarization 
Organizing committee 
Udo Hahn, University of Freiburg 
Chin-Yew Lin, USC Information Sciences Institute 
Inderjeet Mani, MITRE 
Dragomir Radev, University of Michigan 
April 30, 2000 
Seattle, Washington 
! 
2345 6 7 4~ 
9 10 I1 12 13 14 15 16 17 18 19 20 21 
25 26 27.28 29 ~24 22 
AUGUST 
S MTW T F S 
13 14 15 16 17 18 19 2021 22 23 24 25 25 
2728 29 30 31 
DECEMBER 
S.,.M T W T F S 
304567489 11 12 13 t5 16 
17 18 19 20 21 22 23 j 
24 25 26 27 28 29 30J 31 -j 
© 2000, Association for Computational Linguistics 
Order additional copies from: 
Association for Computational Linguistics 
75 Paterson Street, Suite 9 
New Brunswick, NJ 08901 USA 
+ 1-732-342-9100 phone 
+1-732-342-9339 fax 
ael@aclweb.org 
I 
I 
I 
I 
1 
! 
I 
I 
I 
I 
I 
I 
i 
| 
i 
i 
I 
I 
l 
i 
I 
I 
I 
i 
I 
i 
I 
PREFACE 
Automatic summarization aims at producing a concise, condensed representation of the key 
information content in an information source for a particular user and task. Interest in automatic 
summarization continues to grow, motivated by the explosion of on-line information sources and 
advances in natural  processing and information retrieval. In fact, some form of automatic 
summarization may be indispensable given the massive information universes that lie ahead in the 21 st 
century. 
The problem of automatic summarization poses a variety of tough challenges in both NL 
understanding and generation. A spate of recent papers and tutorials on this subject at conferences such 
as ACL/EACL, AAAI, ECAI, IJCAI, and SIGIR point to a growing interest in research in this field. 
Several commercial summarization products have,also' appeared. There have been several workshops 
in the past on this subject: in Dagstuhl (1994), Madrid (1997), and Stanford (1998). It is our great 
pleasure to bring the fourth such event to you. 
While the field continues to progress, there are also many problems that need to be addressed before 
the promises of automatic text summarization can be fully realized. The papers included in this volume 
present a snapshot of recent progress in solving some of the problems: concept identification, 
identification of discourse markers, multi-document summarization, content visualization, evaluation, 
and use. Out of 29 submissions to the workshop, 10 are included in these proceedings. 
We would like to thank all authors for their hard work. This workshop would not be possible without 
them. We would also like to thank the program committee and all the reviewers for their valuable 
feedback. In addition, we are grateful to the ACL for sponsoring the workshop and to the MITRE 
Corporation for supporting the publication of the proceedings. 
Udo Hahn 
Chin-Yew Lin 
Inderjeet Mani 
Dragomir Radev 
Elisabeth Andr6 
Branimir Boguraev 
Chris Buckley 
Michael Elhadad 
Takahiro Fukushima 
Udo Hahn 
Eduard Hovy 
Hongyan Jing 
Elizabeth Liddy 
Chin-Yew Lin 
Inderjeet Mani 
Daniel Marcu 
Shigeru Masuyama 
Mark Maybury 
Vibhu Mittal 
Sung Hyon Myaeng 
Akitoshi Okumura 
Dragomir Radev 
Chris Paice 
Karen Sparck-J0nes 
Tomek Strzalkowski 
Simone Teufel 
Benjamin Tsou 
Ulf Hermjakob 
Hsin-Hsi Chen 
ORGANIZING COMMITTEE 
University of Freiburg 
USC/Information Sciences Institute 
MITRE 
University of Michigan 
PROGRAM COMMITTEE 
DFK/GmbH 
IBM Research 
SablR Research 
Ben Gurion University 
Telecommunications Advancement Organization of Japan 
University of Freiburg 
USCflnformation Sciences Institute 
Columbia University 
Syracuse University 
USCflnformation Sciences Institute 
MITRE 
USC/information Sciences Institute 
Toyohashi University of Technology 
MITRE 
Just Research 
Chungnam National University 
NEC 
University of Michigan 
Lancaster University 
University of Cambridge 
GE CRD 
University of Edinburgh 
City University ofHong Kong 
ADDITIONAL REVIEWERS 
USC/Information Sciences Institute 
National Taiwan University, Taiwan 
! 
I 
I 
I 
I 
| 
i 
I 
i 
'i 
I 
i 
I 
i 
i 
I 
i 
I 
TABLE OF CONTENTS 
Concept Identification and Presentation in the Context of Technical Text Summarization 
Horacio Saggion and Guy Lapalme ............................................................................................................ 1 
Mining Discourse Markers for Chinese Textual Summarization 
Samuel W. K. Chan, Tom B, Y. Lai, W. J. Gao, and Benjamin K. T'sou ................................................ 11 
Centroid-based summarization of multiple documents:sentence extraction, utility-based evaluation, and user 
studies 
Drago .mir R. Radev, Hongyan Jing, Malgorzata Budzikowska ................................................................ 21 
Extracting Key Paragraph based on Topic and Event Detection --- Towards Multi-Document Summarization 
Fumiyo Fukumoto and Yoshimi Suzuki ................................................................................................... 31 
Multi-Document Summarization By Sentence Extraction 
Jade Goldstein, Vibhu Mittal, Jaime Carbonell, and Mark Kantrowitz .................................................... 40 
Text Summarizer in Use: Lessons Learned from Real World Deployment and Evaluation 
Mary Ellen Okurowski, Harold Wilson, Joacquin Urbina, Tony Taylor, Ruth Colvin Clark, and Frank 
Krapcho .................................................................................................................................................... 49 
Evaluation of Phrase-representation Summarization based on Information Retrieval Task 
Mamiko Oka and Yoshihiro Ueda ............................................................................................................ 59 
A Comparison of Ranla'ngs Produced by Summarization Evaluation Measures 
Robert L.Donaway, Kevin W. Drummey, and Laura A. Mather ............................................................. 69 
Multi-document Summarization by Visualizing Topical Content 
Rie Kubota Ando, Branimir K. Boguraev, Roy J. Byrd, and Mary S. Neff ............................................. 79 
Using Summarization for Automatic Briefing Generation 
Inderjeet Mani, Kristian Concepcion, and Linda van Guilder 89 
Branimir Boguraev ..................................... 79 
Roy Byrd ..................................................... .79 
Malgorzata Budzikowska ........................... 21 
Jaime Carbonell .......................................... 40 
Samuel W. K. Chan ................................... 11 
Ruth Colvin Clark ....................................... 49 
Kristian Concepcion ................................... 89 
Robert L. Donaway ..................................... 69 
Kevin W. Drummey .................................... 69 
Fumiyo Fukumoto ...................................... 31 
W. J. Gao .. .................................................. 11 
Jade Goldstein ............................................. 40 
Hongyan Jing ................................. . ............ 21 
• Mark Kantrowitz ......................................... 40 
Frank Krapcho ............................................ 49 
Rie Kubota Ando ........................................ 79 
Tom B. Y. Lai ............................................. 11 
AUTHORINDEX 
Guy Lapalme .................................................. 1 
Inderjeet Mani ............................................. 89 
Laura A. Mather .......................................... 69 
Vibhu Mittal ................................................ 40 
Mary Neff. ................................................... 79 
Mamiko Oka ............................................... 59 
Mary Ellen Okurowski ................................ 49 
Dragomir R. Radev ............................ . ........ 21 
Horacio Saggion ............................................ 1 
Yoshimi Suzuki ........................................... 31 
Tony Taylor ........... . .................................... 49 
Benjamin K. T'sou ...................................... 11 
Yoshihiro Ueda... ........................................ 59 
Joacquin Urbina .......................................... 49 
Linda Van Guilder ...................................... 89 
Harold Wilson ............................................. 49 
