File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/p00-1010_abstr.xml
Size: 2,409 bytes
Last Modified: 2025-10-06 13:41:42
<?xml version="1.0" standalone="yes"?> <Paper uid="P00-1010"> <Title>Robust Temporal Processing of News</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We introduce an annotation scheme for temporal expressions, and describe a method for resolving temporal expressions in print and broadcast news. The system, which is based on both hand-crafted and machine-learnt rules, achieves an 83.2% accuracy (Fmeasure) against hand-annotated data. Some initial steps towards tagging event chronologies are also described.</Paragraph> <Paragraph position="1"> Introduction The extraction of temporal information from news offers many interesting linguistic challenges in the coverage and representation of temporal expressions. It is also of considerable practical importance in a variety of current applications. For example, in question-answering, it is useful to be able to resolve the underlined reference in &quot;the next year, he won the Open&quot; in response to a question like &quot;When did X win the U.S. Open?&quot;. In multi-document summarization, providing fine-grained chronologies of events over time (e.g., for a biography of a person, or a history of a crisis) can be very useful. In information retrieval, being able to index broadcast news stories by event times allows for powerful multimedia browsing capabilities.</Paragraph> <Paragraph position="2"> Our focus here, in contrast to previous work such as (MUC 1998), is on resolving time expressions, especially indexical expressions like &quot;now&quot;, &quot;today&quot;, &quot;tomorrow&quot;, &quot;next Tuesday&quot;, &quot;two weeks ago&quot;, &quot;20 mins after the next hour&quot;, etc., which designate times that are dependent on the speaker and some &quot;reference&quot; time1. In this paper, we discuss a temporal annotation scheme for representing dates and times in temporal expressions. This is followed by details and performance measures for a tagger to extract this information from news sources. The tagger uses a variety of hand-crafted and machine-discovered rules, all of which rely on lexical features that are easily recognized. We also report on a preliminary effort towards constructing event chronologies from this data.</Paragraph> </Section> class="xml-element"></Paper>