File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/j00-2001_abstr.xml
Size: 4,235 bytes
Last Modified: 2025-10-06 13:41:41
<?xml version="1.0" standalone="yes"?> <Paper uid="J00-2001"> <Title>Integrating Text Planning and Linguistic Choice Without Abandoning Modularity: The IGEN Generator</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> 1. Introduction </SectionTitle> <Paragraph position="0"> Programs that generate natural language have generally been divided into two largely independent components: a text planning component that organizes the information to be expressed, and a linguistic component that converts the information into grammatical sentences of some natural language (e.g., Thompson 1977; McKeown 1985; McDonald 1983; Meteer 1989; Reithinger 1990; Dale 1989). This division seems natural, and has indeed proven useful, because the two components deal with different kinds of data and involve different kinds of reasoning. The planning component reasons about information and text structures to plan a coherent discourse, while the linguistic component arranges the lexical elements and syntactic constructions available in some language to ensure the resulting text's grammaticality and comprehensibility.</Paragraph> <Paragraph position="1"> This division into independent components, though, is only tenable if the decisions each component must make can be made independently. If some of the generator's work involves both text planning and purely linguistic issues, there will be no way to make the necessary decisions without violating the generator's modularity. As we shall see, natural language generation does indeed involve such decisions. The need to handle interactions between text planning and linguistic concerns thus seems to require either abandoning the division into separate components (e.g., Danlos 1987; Kantrowitz and Bates 1992), with the resulting increase in complexity, or foregoing the ability to handle these interactions at all (as advocated in Reiter \[1994\]). A third option is to identify possible interactions in advance and handle them as special cases, but this only allows the generator to handle those interactions that have been antici* AnswerLogic, Inc., 1111 19 St. NW, Suite 600, Washington, DC 20036. E-mail: rrubinoff@answerlogic. com. The research described here was done at the University of Pennsylvania and Carnegie Mellon University.</Paragraph> <Paragraph position="2"> (~) 2000 Association for Computational Linguistics Computational Linguistics Volume 26, Number 2 pated in advance. Furthermore, handling the interactions often requires violating the modularity of the system, because detecting when the special case has occurred still requires both planning and linguistic knowledge.</Paragraph> <Paragraph position="3"> The IGEN generator solves this problem. IGEN handles interactions between the text planning and linguistic components without having to sacrifice any of the generator's modularity. The key to IGEN's approach is the use of annotations that the linguistic component attaches to each linguistic expression it constructs. These annotations abstract away from the details of the linguistic expressions, describing only those properties of the expressions that are potentially relevant to the planner. The planner can then evaluate the choices made by the linguistic component and determine how those choices interact with the text plan independently of the linguistic component's processes and data structures. As a result, IGEN can make decisions involving interactions between the components while retaining complete modularity. In fact, replacing IGEN's normal linguistic component with one for a different language involves no change in the planner despite the fact that the two languages have d~ferent idiomatic expressions for some of the sentences being generated; the planner's processing is identical in both languages. Furthermore, since the annotations allow IGEN to explicitly model the effects of its decisions, IGEN is able to gracefully handle the effects of limitations on processing time or linguistic resources; since IGEN always knows how well a given linguistic construction carries out its plan, it can choose the best available construction when it can't find an ideal one.</Paragraph> </Section> class="xml-element"></Paper>