File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/97/w97-0619_metho.xml
Size: 20,316 bytes
Last Modified: 2025-10-06 14:14:43
<?xml version="1.0" standalone="yes"?> <Paper uid="W97-0619"> <Title>Corpus-Based Information Presentation for a Spoken Public Transport Information System* M.M.M. Rats and R.J. van Vark and J.P.M. de Vreught Knowledge Based Systems</Title> <Section position="4" start_page="106" end_page="107" type="metho"> <SectionTitle> 3 Information presentation in VIos </SectionTitle> <Paragraph position="0"> In VIos, the travel plan is presented using templates filled with specific stations and times. The station names and times have a disambiguated form, always resulting in a full and uniquely identifying description. The templates and the words for stations and times are prerecorded and their accoustic representations are concatenated to form complete sentences. Together these sentences form a monologue. No interruptions by the caller are possible (see figure 1). The architecture of VIOS is such that the dialogue management component and the text generating component work in sequential order. When the travel scheme has been determined, the dialogue manager sends the entire scheme to the text generator. This component communicates the plan to the caller. During the presentation process, the system stops listening. So, no feedback from the user to the system is possible.</Paragraph> <Paragraph position="1"> Appreciation studies have shown that the VIos presentation of information is far from ideal for the following reasons (Blasband, 1997; Manintveld and Rothkrantz, 1997; Peters and Kock, 1997): All information is given at once. It would be more convenient for the caller if the system would provide the information in smaller chunks.</Paragraph> <Paragraph position="2"> * The caller is not able to interrupt when he does not comprehend the given information.</Paragraph> <Paragraph position="3"> The use of full names for stations is very unnatural and confusing, especially when the caller has used other descriptions to introduce them.</Paragraph> <Paragraph position="4"> The system makes the caller feel hunted in processing and copying down the information since it speaks too fast.</Paragraph> <Paragraph position="5"> The information is spoken in an unnatural way.</Paragraph> <Paragraph position="6"> The concatenated phrases exhibit differences in speech tempo and loudness. Since they miss a natural intonation pattern, they are awkward to understand.</Paragraph> <Paragraph position="7"> In this paper, we will only work towards a solution for the first two problems. The next three items will be left for further research. We will take the human-human dialogues as an example. This choice is dictated by our comprehensive appreciation study, where 500 respondents were asked to call different Wizard of Oz simulation systems, VIOS, and a C: ik wilde graag weten als ik morgen ore: morgenochtend om negen uur \[u:h\] op de Lelylaan in Amsterdam moet zijn hoe laat moet ik vanuit Hilversum vertrekken dan? h dat wordt\[u:h\] acht uur negentien vertrekken uit Hilversum C: acht uur ja I: en dan overstappen Amsterdam C: \[u:h\] ja h vertrek Amsterdam acht vijfenvijftig C: acht vijfenvijftig h komt u negen nul drie aan Amsterdam Lelylaan C: ok6 dus acht uur negentlen vertrekken hartstikke bedankt C: I would like to know if I have to be tomorrow at: tomorrow morning at nine o'clock \[u:h\] at the Lelylaan in Amsterdam ~t what time do I have to leave from Hilversum? h that will be\[u:h\] depart from Hilversum at eight hours nineteen C: eight hours yes h and then change Amsterdam C: \[u:h\] yes I: departure Amsterdam eight fifty-five C: eight fifty-five h will arrive at Amsterdam Lelylaan at nine o three C: okay so depart at eight hours nineteen okay thank you very much human operator. One of its most important conclusions was that callers appreciate the human operator over all kinds of automated systems.</Paragraph> </Section> <Section position="5" start_page="107" end_page="111" type="metho"> <SectionTitle> 4 Information presentation in </SectionTitle> <Paragraph position="0"> human-human OVR dialogues A study of a sample of 100 information dialogues out of a corpus of over 5000 dialogues shows that the presentation of a travel plan in a human-human dialogue involves more than just a monologue that presents the entire plan at once. The information presentation has a more interactive form. A human information service presents the travel plan in a step-wise way, generally giving at least one piece of new information with each turn. This presentation follows the temporal order of the different stages in the travel plan. After each step in the information presentation, the caller shows that he has processed the step by an acknowledgement. The dialogue fragment displayed in figure 2 shows an example of an information presentation in an OVR dialogue. We see that each step in the plan is acknowledged, before the next one is given.</Paragraph> <Paragraph position="1"> This stepwise presentation and acceptance of the travel plan is one of the most important characteristics of the information phase of a naturally occurring OVR dialogue. In this way, the operator is able to communicate the information as clear as possible and the caller can relate the new information to information already known.</Paragraph> <Paragraph position="2"> Table 1 shows the quantitative facts that underly our description. It shows the amount of utterances the information service applies in one turn. We see that 87% of the turns contain only one utterance and 10% contain two utterances. The maximum of utterances per turn is 5, which is in only 0.4% of the cases. Only in this case with five utterances, a whole travel plan is given at once. This confirms our view that in human-human OVR dialogues, the travel plan is given in steps.</Paragraph> <Paragraph position="3"> A closer look at the exact information transfer in an OVR dialogue reveals even more about the exact information structure of the individual utterances of the information service. The information transfer in an OVR dialogue consists of three phases: 1. a query phase, 2. a search phase, and 3. an information phase.</Paragraph> <Paragraph position="4"> During the query phase the caller poses his query, and the information service tries to understand this query as clear as possible. In the search phase (often indicated by a silence since the operator is searching), the information service applies the database query, and chooses the right travel plan. In the information phase, the information service communicates the travel plan to the caller and the caller tries to get the plan as straight as possible.</Paragraph> <Paragraph position="5"> In each of these phases, different information elements play a crucial role. During the query phase, these elements are the departure place, the arrival place, a global indication of the departure or arrival time, the day of travel, and if the caller wants a direct connection. The information service needs these information elements to compose an appropriate database query and to choose the most suitable travel plan. For the presentation of the travel plan, other information elements become important: the departure time, the arrival time, the places where to change, the directions of trains, the departure and arrival times at the places of change. The caller needs to know these information elements to carry out this plan.</Paragraph> <Paragraph position="6"> In the information phase, the information service uses these information elements to compose her presentation. In general, she will use the elements that are already known from the query phase or from previous utterances within the information phase, as a point of attachment for presenting the unknown elements. The order of the steps in the travel plan are the guiding principle behind the order in which the elements are presented. The dialogue fragment in figure 2 illustrates this. In the query phase of this dialogue, the speakers have established the day, a global arrival time, the arrival place, and the departure place. The information service has found a proper travel plan and starts her presentation.</Paragraph> <Paragraph position="7"> In consecutive turns, she gives the departure time (new) at the departure place (given), then the place where to change (new), then the departure time (new) at the place of change (given in the previous utterance), then the arrival time (new) at the arrival place (given).</Paragraph> <Paragraph position="8"> Tables 2, 3, and 4 show the quantitative facts that underly our description. Table 2 shows the amount of information elements for each utterance of the information service. We see that 69% of the utterances contain 2 information elements, that 18% contains only one element, and 12% contains 3 elements. This confirms our view that mainly two information elements per utterance are given.</Paragraph> <Paragraph position="9"> with one, two, three, and four information elements respectively. We see that utterances with one element contain in most of the cases (68%) a new element. In the other cases (32%), they either refer to a crucial step in the information exchange, or give a alternative description of a previously introduced element. We see that most utterances with more than one information element contain at least one new element. The given-new distinctions in the table reflect the order in which the elements occur. One can observe that in utterances with two elements, the speakers prefer to mention the new element first, * while in case of three elements, speakers prefer to mention a given element first.</Paragraph> <Paragraph position="10"> one, two, three, and four information elements respectively.</Paragraph> <Paragraph position="11"> The table confirms our view that speakers tend to present at least one piece of new information per utterance. Usually, they will relate this new information with an entity introduced in the preceding context. This communicative rule was found in other corpora of spoken discourse as well (Chafe, 1987; Rats, 1996).</Paragraph> <Paragraph position="12"> Table 4 shows which specific information elements generally serve as given and which serve as new information. We see that most departure and arrival times (97% in both cases) are new for the caller, while all departure and arrival places (100% in both cases) are known from the previous context. For the places of change and the directions, the result is roughly fifty-fifty.</Paragraph> <Paragraph position="13"> The table confirms our observation that departure and arrival places are generally introduced during the query phase and serve as given information during the information phase. On the contrary, places of change and the directions of train are mostly introduced in the information phase, and become given information after introduction. The departure and arrival times commonly serve as new information.</Paragraph> <Paragraph position="14"> After the presentation and acceptance of a whole travel plan, a caller may ask for new travel plans. This happens in 26% of the dialogues. Most of these plans (84%) concern earlier or later connections. In 16% of the cases, the callers ask for another travel plan, for instance, for a connection from the station where the previous trip ended or another connection from the same departure place.</Paragraph> <Paragraph position="15"> 5 Repair in the presentation phase Figure 2 shows an information phase where the caller has no problems in processing the presentation of the travel plan. Nevertheless, during communication problems may arise. In such cases, the caller will interrupt the presentation by starting a repair sequence to solve the problem: * The caller will start a reconfirmation sequence, if he is not sure that he has heard the operator's utterances well and he wants the information service to repeat, to complete, or to.confirm previously given information. A reconfirmation sequence consists of a reconfirm by the caller and an appropriate answer by the information service.</Paragraph> <Paragraph position="16"> * The caller will apply a clarification sequence, if he wants extra information about the current plan. A clarification sequence consists of a wh-question of the caller and an appropriate answer by the information service.</Paragraph> <Paragraph position="17"> * The caller will apply a checking sequence, if he wants to check extra information about the current plan that he suspects to be true. A checking sequence consists of a check by the caller and an appropriate answer by the information service.</Paragraph> <Paragraph position="18"> * The caller will start a correcting sequence, if he notices that the information service gives inappropriate information. A correcting sequence consists of a correction and possibly a negative acknowledgement by the caller and an appropriate answer by the information service.</Paragraph> <Paragraph position="19"> In case the caller does not notice problems himself, the information service may infer from the caller's responses that the caller did not process her utterances as intended. She can infer this from his acknowledgements, but also from his reconfirmations, checks, and wh-questions. In such cases, she will interrupt the information presentation by a correcting sequence.</Paragraph> <Paragraph position="20"> Table 5 shows the frequency of the caller's repair sequences compalred with the presence of positive acknowledgements. We see that repair sequences do not occur as frequently as positive acknowledgements. Most of the caller's reactions (63%) are positive acknowledgements. Considerably less of the caller's reactions are wh-questions (4%), checks (10%), or reconfirmations (17%), and very few reactions are corrections. In 6% of the cases, the reactions concern questions for other or related travel plans.</Paragraph> <Paragraph position="21"> Repair sequences appear at different places in the information exchange. They may appear directly after the utterance to which they react. However, they may also occur after the acknowledgement of the utterance to which they react, and at the end of the complete presentation and acceptance of the travel plan. Table 6 shows the frequencies of these three possibilities for each repair act.</Paragraph> <Paragraph position="22"> We see that wh-questions, checks, and questions for an extra travel plan mainly occur after the complete presentation of the travel plan. By contrast, reconfirmations and corrections mainly occur directly after the problematic utterance. This difference is of course understandable. Reconfirmations and corrections directly concern problems in processing the previous utterance, while wh-questions and checks mainly ask or check extra information about the travel plan. It is more polite to keep these last kind of questions until the information service is ready with her presentation.</Paragraph> <Paragraph position="23"> Nevertheless, the table shows that speakers may violate these habits, since they may utter reconfirmations after the whole presentation (10%) --although it seemed that they had understood and accepted it-- and wh-questions and checks directly after an informing utterance or the acknowledgement of that utterance.</Paragraph> <Paragraph position="24"> 6 Towards a new strategy of information presentation Analysis of information presentation in VIOS and OVR dialogues shows an important difference in strategy. The VIos system presents complete travel plans as a whole, while human operators give the information in several chunks. Future releases of VIOS will have to follow this latter strategy as much as possible, because it highly influences the appreciation of clients.</Paragraph> <Paragraph position="25"> As a result, the travel plan will have to be divided in manageable chunks of information which follow the temporal order of the travel schedule. Each piece of information corresponds to a turn in the dialogue. Such a turn will introduce exactly one new information element as happens in most of the OVR dialogues. This new information element will often be accompanied by an already given element.</Paragraph> <Paragraph position="26"> We have taken a first step in this direction, by extracting presentation scenarios for different dialogues situations from our sample corpus. The choice for a certain scenario will depend on two types of information: 1. the information elements that the system has gathered during the query phase, and 2. the information that the system has received from the database query.</Paragraph> <Paragraph position="27"> The information acquired from the database will influence the choice for a certain scenario most, since a travel scheme with two changes will result in another presentation than a direct connection. Nevertheless, the scenario must also contain the right given-new combinations for the individual utterances. The information elements that the system has gathered during the query phase will have to serve as the given information, while the new information that the system has received from the database query will have to function as the new information.</Paragraph> <Paragraph position="28"> Table 7 gives an example of such a scenario. It shows which scenario should be used, given certain information elements gathered during the query phase and the information elements brought up by the database query. Each line in this scenario refers to a separate chunk of information. It also shows the order in which the elements should be uttered. The table also gives a possible linguistic form of the separate lines in the scenario. After each line, an acknowledgement or a short repair sequence may follow. null The dialogue manager will incorporate the lines into separate statements and will send them one by one the text generator, awaiting the user's reaction before to decide to go on. The dialogue manager will proceed with the next chunk if the user has acknowledged the presented information. In this case, a relatively long period of silence can also be taken as a positive acknowledgement. If the user reacts by a wh-question, a check, or a reconfirmation, the appropriate response will be given before it will continue the presentation. If a misinterpretation is detected the system will first start a correction sequence. When the complete presentation is finished and thus acknowledged by the caller, he may either finish the conversation or pose a new query. In the last case, the whole dialogue management process will be started again: the representation of the query will be updated, a new database query will be posed, and an appropriate scenario will be chosen. The same will happen when the user corrects the system because it does not give the plan he wants.</Paragraph> <Paragraph position="29"> To improve information presentation further, dialogue management and text generation will have to collaborate intensively. The text generator will have to choose the right linguistic form, following Information given by the query phase: Information given by the database query: Scenario: Possible linguistic form: Departure place, arrival place, and a global indication of the departure or arrival time One train connection with one change, exact departure times, exact arrival times, place of change which is the same as the direction of the train, the direction of the second train is the same as the arrival place en dan bent u daar om \[ART\] ( I have a train at \[DeT\] from \[DeP\].</Paragraph> <Paragraph position="30"> It will arrive at \[ART\] in \[PoC\].</Paragraph> <Paragraph position="31"> There will leave at \[DeT\] the train to \[ArP\] and then you will be there at \[ART\].) the prescribed dialogue act and given-new division. The corpus work described in Rats and Bunt (Rats and Bunt, 1997) on information packaging in Dutch information dialogues may be used for this. The dialogue manager will probably have to incorporate extra contextual information into its instructions, in case several repair sequences will appear between two information chunks. Since in such a case, the system may have to use extra linguistic devices to show the user that he is going to continue the presentation of the travel plan.</Paragraph> </Section> class="xml-element"></Paper>