File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/93/p93-1031_metho.xml

Size: 22,852 bytes

Last Modified: 2025-10-06 14:13:30

<?xml version="1.0" standalone="yes"?>
<Paper uid="P93-1031">
  <Title>TAILORING LEXICAL CHOICE TO THE USER'S VOCABULARY IN MULTIMEDIA EXPLANATION GENERATION</Title>
  <Section position="3" start_page="0" end_page="228" type="metho">
    <SectionTitle>
1. Introduction
</SectionTitle>
    <Paragraph position="0"> A language generation system should select words that its user knows. While this would seem to involve simply selecting a known word instead of an unknown word (as is done, for example, in \[1\]), in many cases it requires entirely rephrasing the rest of the sentence. For example, in our domain of equipment maintenance and repair, if the user does not know the word &amp;quot;polarity,&amp;quot; a sentence like &amp;quot;Check the polarity.&amp;quot; will be rephrased as &amp;quot;Make sure the plus on the batte~,lines up with the plus on the battery compartment. Even when alternative words can be used-instead of an unknown word (e.g., a descriptive expression can be used instead of an object name), the alternative phrase may interact with other parts of the sentence which then need to be reworded as well.</Paragraph>
    <Paragraph position="1"> In this paper, we discuss the different strategies used in COMET for selecting words with which the user is familiar. Since COMET integrates text and pictures in a single explanation 1, unknown words are frequently disambiguated through accompan, ying. pictures. For example, when the accompanying picture clearly shows the object and its location, COMET will use the most common object name even if the user is unfamiliar with the name 2. When pictures cannot be used to disambiguate a word-or phrase, COMET has four strategms for avoiding unknown words: 1. Selecting an alternative word or phrase (e.g., generating &amp;quot;some number&amp;quot; instead of &amp;quot;arbitrary number' ') 2. Rephrasing by providing conceptual definitions (e.g., generating &amp;quot;Make sure the plus on the battery lines up with the plus on the battery compartment.&amp;quot; instead of &amp;quot;Check the polarity&amp;quot;)  3. Rephrasing by generating descriptive referring expressions (e.g., generating &amp;quot;the cable that runs to the KY57&amp;quot; instead of &amp;quot;the COMSEC cable' ') 4. Using past discourse to construct a  referring expression (e.g., generating &amp;quot;Test the cable you just removed.&amp;quot; instead of &amp;quot;Test the COMSEC cable.&amp;quot; if the user had previously been instructed to remove this cable.) In the following sections, we first t?rov!de an overview of lexical choice in COMET, snowing how and where it occurs in the overall system. Each of the strategies is then described in turn, prefaced by a brief discussion of disambiguation of unknown terms through pictures. Finally, we compare our work with previous work in the area.</Paragraph>
    <Paragraph position="2">  COMET's architecture is shown in Figure 1. On receiving a request for an explanation via a menu interface, the content planner uses schemas \[6\] to determine which information should be included in the explanation from the underlying knowledge sources. The explanation content, represented as a hierarchy of logical forms (LFs) \[7\] is passed to the media coordinator \[3, 8\], which adds annotations indicating which portions are to be produced by the text generator and which by the graphics generator \[9\]. The Lexical Chooser is part of the text generator \[7\]. Typically, it selects a word or phrase for each semantic concept in the input LF (i.e., the semantic constraints on word choice). In terms of coverage, the implementation can select words for 148 different semantic concepts using 253 mapping rules, thus yielding on average slightly less than two alternative word choices per concept (there are many concepts which are mapped to a single word, while others have more than two alternatives). The lexicon contains 159 open class words.</Paragraph>
    <Paragraph position="3"> In this paper, we show how the user model and past discourse (pragmatic constraints) also influence word choice. But these are not the only constraints on word choice. Syntactic form of the sentence and lexical constraints are other demonstrated \[10, 11\] influences on lexical choice. For example, once the verb has been chosen, syntactic constraints on its arguments (e.g., whether the object is a clause,  an adj, or np) will influence what words are chosen to realize the semantic concept that fill these arguments. Conversely, if one of the verb roles can only be realized as a noun phrase, for example, and not as other syntactic categories, this restricts which verb is selected. Lexical constraints on word choice arise from the use of collocations [12]. For example, a verb like &amp;quot;stand&amp;quot; takes the preposition &amp;quot;on&amp;quot; for its loca-tion role, while the verb &amp;quot;turn&amp;quot; takes the preposition &amp;quot;onto.&amp;quot; Lexical choice is thus influenced by a wide variety of constraints which interact in many ways.</Paragraph>
    <Paragraph position="4"> Since syntactic and lexical constraints are only available within the text generator, lexical choice is delayed until this point. Thus COMET waits until a variety of semantic, pragmatic, syntactic and lexical constraints are accumulated before selecting words.</Paragraph>
    <Paragraph position="5"> This means that COMET can use syntactic and lexical constraints on word choice in conjunction with semantic and graphical constraints provided as input, plus the new. pragmatic constraints we present. Previous work addressing pragmatic constraints on word usage folded lexical choice into the content planner (e.g., [13], [1]). This was possible since the work focused primarily on lexical side effects of content determination (e.g., what property to include in a reference as opposed to what linguistic form to use for a property). Such approaches do not allow a system to take syntactic and lexical constraints on word choice into account.</Paragraph>
    <Paragraph position="6"> On receiving the hierarchy of logical forms, the Lexical Chooser determines the overall grammatical form of each sentence based on the semantic structure of the LFs (e.g., conditional sentences are generated for precondition-action structures) and selects the words and phrases realizing semantic concepts of the LF. It aSSeS a specification of the sentence's grammatical rm and open-class words to the general purpose surface sentence generator FUF [14, 15, 16]. The Lexical Chooser uses a rewriting system itself implemented on top of FUF. Its lexicon consists of a base of rules, where each rule rewrites a given set of semantic features into a corresponding set of lexical and syntactic features. Thus, each lexicon entry associates a semantic concept with words that can be used to realize it. Additional constraints from the user model, past discourse, and the underlying knowledge base determine which of the alternative words or phrases should be selected. 3 The user model indicates both the reading level of the current user 4, any individual words that COMET knows the user does not understand, and any wording preferences (e.g., the user knows abbreviations, the user is familiar with military terminology). We make no claims about which of these forms of user models is easier to acquire, but simply show how to use them when available. null If none of the alternative wordings for a given semantic concept of the LF are known to the user and the  Install the new holding battery. Step 2 of 6 Remove the old holding battery, shown in the cutaway view. Figure 3: Use of Cross References: Remove the holding battery, shown in the cutaway view accompanying illustration cannot disambiguate these words, COMET reinvokes the content planner to replan portions of the sentence content or to include additional semantic information. Thus, COMET's architecture interleaves lexical choice and content planning in order to account for a wide variety of mteracting constraints on word choice.</Paragraph>
  </Section>
  <Section position="4" start_page="228" end_page="229" type="metho">
    <SectionTitle>
3. Multimedia Disambiguation
</SectionTitle>
    <Paragraph position="0"> An accompanying picture often makes clear what the referent of a referring expression is. If the user is unfamiliar with a term, the accompanying picture might define it. For example, Figure 2 shows one step of an explanation generated by COMET for loading frequency into the radio. The text refers to a &amp;quot;FCTN knob ' and the accompanying picture clearly singles out the knob on the front panel of the radio [4]. COMET can also generate an explicit reference to the illustration itself (called a cross reference). For example, the cross reference shown in Figure 3 is generated if the user does not understand the term &amp;quot;holding battery&amp;quot;. In this case, the Lexical Chooser, on determining that &amp;quot;holding battery&amp;quot; is an unfamiliar term, reinvokes the content planner which finds that no accompanying illustration is currently planned and invokes graphics to generate an accompanying illustration that depicts the holdin~ battery and its location. For full details on cross reierencing in COMET see [ 18].</Paragraph>
    <Paragraph position="1">  strate alternative wordings. The first italicized phrase is generated if the user's vocabulary level is above a certain reading level or if a word is not explicitly listed in the user model as unknown. Since the lexicon maintains a simple association between the semantic concept and alternative phrasings, COMET selects the first alternative which the user model indicates is familiar to the user. For example, Figure 5 shows that for any concept under the concept c-disconnect in the knowledge base taxonomy, COMET will use the word &amp;quot;disconnect&amp;quot; if the user's vocabulary level is high and the word &amp;quot;remove&amp;quot; otherwise. COMET also checks whether the user knows abbreviations and if so, will use a referring expression such as &amp;quot;FCTN knob&amp;quot; as shown m  tion knob&amp;quot;). If COMET has no information about the user, it generates the abbreviation and relies on the accompanying illustration to clarify the referent.</Paragraph>
    <Paragraph position="2">  1. Screw the new manpack antenna onto the RT and tighten until the manpack antenna is snug/tight.</Paragraph>
    <Paragraph position="3"> 2. Disconnect/Remove the COMSEC cable from the KY57 audio connector.</Paragraph>
    <Paragraph position="4"> 3. This will cause the display to show an arbitrary/some number.</Paragraph>
  </Section>
  <Section position="5" start_page="229" end_page="231" type="metho">
    <SectionTitle>
5. Rephrasing through Replanning
</SectionTitle>
    <Paragraph position="0"> Selecting an alternative wording for a semantic concept is not always possible since none of the alternatives may be known by the user. Instead, COMET can describe concepts at a more detailed semantic level of abstraction by retrieving additional definitional information from the knowledge base and it can create referring descriptions when object names are not known, by retrieving object attributes.</Paragraph>
    <Paragraph position="1"> 5.1. Retrieving alternative concept definitions Sometimes the original text uses a word or phrase that abstracts the details of a concept to allow generation of a very concise expression. If unfamiliar with the word or phrase, the user will be unable to infer the specifics needed to perform the task. Alternative wordings require choosing a less abstract level of semantic decomposition at which to describe the concept. In these cases, COMET's lexical chooser reinvokes the content planner to retrieve a finer grained definition of the concept from the knowledge base.</Paragraph>
    <Paragraph position="2"> For example, this strategy is used for rephrasing the request &amp;quot;Check the polarity&amp;quot; which COMET issues when providing instructions for installing a new holding battery. More detailed semantics of checking the polarity are stored as different tokens of the concept c-polarity in the knowledge base. 5 For example, in Figure 6 polarity is represented as the ecjuivalence between the two plusses on two batteries deg. Now, if the plan calls for checking polarity, it can be represented In terms of a checking action on the equivalence of these two plusses (i.e., that they line up). If the user is unfamiliar with the word &amp;quot;polarity,&amp;quot; an alternate decomposition will be retrieved and replace the phenomenon role filler in the original LF (Figure 7).</Paragraph>
    <Paragraph position="3"> Figure 8 shows the alternative LF with a new phenomenon role (the remainder of the LF is unchanged). The resulting rephrased sentence is &amp;quot;Make sure that the plus on the battery lines up with the plus on the battery compartment.. &amp;quot;Lines up' is selected in the lexicon for the equivalence relatlon based on the semantics of its roles (i.e., that they are both plusses on the batteries). Here semantic selectional restrictions on the.roles control lexical choice of the verb.</Paragraph>
    <Paragraph position="4"> Since the object of the new sentence is an embedded sentence, COMET can use either the verb &amp;quot;check&amp;quot; or the collocation &amp;quot;make sure&amp;quot; as the verb realizing the mental process concept c-check. Note that, while these two verbs are listed as alternatives in the lexicon for c-cheek, &amp;quot;make sure&amp;quot; cannot be used in the original sentence due to a syntactic constraint: its object cannot be an NP as one cannot say &amp;quot;Make sure the polarity.. This is an example of interaction between syntactic and pragmatic constraints. Since syntax does not constrain the choice of verb in the modified sentence, COMET arbitrarily selects &amp;quot;make sure' '.</Paragraph>
    <Paragraph position="5"> The lexicon entry containing these two verbs is shown below in Figure 9. Note that the entry is indexed by the semantic concept c-check. There are two alternative verbs, only one of which is compatible with a clause as phenomenon role (ultimately the object). When the phenomenon is an NP, both verbs are valid and one is randomly selected.</Paragraph>
    <Paragraph position="6">  ; represented as two plusses which should ; be equivalent. The roles of the ec/uative ; relation are identified and identifier</Paragraph>
    <Paragraph position="8"> with c-check since in our domain checking is carried out on many different objects, while few actions are carried out on polarity.</Paragraph>
    <Paragraph position="9"> 6The equative relations has two roles, identified and identifier. Since they are included here, the equative relation (i.e., that the two plusses &amp;quot;line up&amp;quot;) is inferred to hold.</Paragraph>
    <Section position="1" start_page="230" end_page="231" type="sub_section">
      <SectionTitle>
5.2. Generating New Referential Descriptions
</SectionTitle>
      <Paragraph position="0"> If the user does not know an object name, the content ~ lanner is reinvoked to generate object attributes to uild a referential description. Although our selection algorithm is not as sophisticated as others \[19, 5, 13\] because we do not use a detailed model of user beliefs, we address a new issue: the interaction between the new description and other parts of the original sentence which may require rephrasing. Two types of object attributes are used in a referring expression in COMET: object subpart relations and atial relations to other objects in the accompanying stration. COMET selects the relations that uniquely identify the object.</Paragraph>
      <Paragraph position="1"> For example, suppose COMET's Lexical Chooser is provided with the LF for sentence 1, Figure 10, but the user does not know the term &amp;quot;COMSEC.&amp;quot; Instead of generating sentence 1, COMET generates sentence 2. To do this, COMET first selects a unique relation between the cable and a known object. In this case, it selects the connects spatial relation between the Radio Transmitter (RT) and the KY57, since this cable is the only one that connects the radio and the KY57. Selecting this relation for the description and substituting it for 'the COMSEC cable would result in sentence 3, Fig. 10. However, COMET notes the redundant references to the audio connector and removes one from the cable modifier by selecting the verb &amp;quot;runs to&amp;quot; instead which only requires one role in the generated sentence. This would result in the sentence 4, Fig. 10. In this sentence, the attachment of the prepositional phrase &amp;quot;from the KY57 audio connector is ambiguous. COMET detects this ambiguity when it removes the first from-location; since the two from-locations would have occurred side by side and both previous verbs of the sentence take it as a modifier, the generator must clarify that it is the from-location of the earlier verb &amp;quot;disconnect&amp;quot; and not &amp;quot;run to.&amp;quot; To remove ambiguity, COMET surrounds the modifier of the cable by commas in sentence 2, Fig. 107.</Paragraph>
      <Paragraph position="2"> Descriptions Generated by COMET:  I. &amp;quot;Disconnect the COMSEC cable from the KY57 audio connector.&amp;quot; 2. &amp;quot;Disconnect the cable, which runs to the RT, from the KY57 audio connector.&amp;quot; Descriptions Avoided by COMET: 3. &amp;quot;Disconnect the cable that connects the RT to the KY57 audio connector from the KY57 audio connector.&amp;quot; 4. &amp;quot;Disconnect the cable that runs to the RT from the KY57 audio connector.&amp;quot;</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="231" end_page="232" type="metho">
    <SectionTitle>
6. Using Past Discourse
</SectionTitle>
    <Paragraph position="0"> For subsequent reference, the presence of a discursive context allows for a wider variety of strategies to get around gaps in the user's vocabulary. COMET takes advantage of this fact by maintaining a discourse history, The content planner records all descriptions into the discourse history, creating one record for the description as a whole and a separate record for each of its roles. The entry for the description has four fields:  in that action (e.g., &amp;quot;COMSEC cable&amp;quot; is the medium of the action &amp;quot;disconnect&amp;quot;). null For each subsequent reference, the concept name is used as the access key and the three other fields are updated; they thus always contain the information on the last reference. By looking up information in the discourse history, the content planner is able to construct object descriptions in terms of the last action it was involved in.</Paragraph>
    <Paragraph position="1"> Sentences generated if the user knows &amp;quot;COMSEC&amp;quot;  1. &amp;quot;Disconnect the COMSEC cable from the KY57 audio-connector.&amp;quot; 2. &amp;quot;Plug in the handset to the KY57 audioconnector.' ' 3. &amp;quot;Test the COMSEC cable.&amp;quot; Sentences generated if not: 4. &amp;quot;Disconnect the cable, which runs to the RT, from the KY57 audio connector.&amp;quot; 5. &amp;quot;Plug in the handset to the KY57 audio connector.&amp;quot; 6. &amp;quot;Test the cable that you just disconnected.&amp;quot;  As an example, consider the explanations COMET enerates when instructing the user how to diagnose ss of side tone. When the user has no vocabulary gaps, COMET .generates sentences 1-3, Figure 1 l. When the user is unfamiliar with the term &amp;quot;COM-SEC,&amp;quot; sentences 4-6 are generated instead. Here COMET uses past discourse to produce a descriptive reference for the second reference to the COMSEC cable.</Paragraph>
    <Paragraph position="2"> As in the previous examples, the gap is detected when the Lexical Chooser checks the user model. Since there is no alternative phrase for &amp;quot;COMSEC&amp;quot; in the lexicon, COMET calls the content planner to replan the reference. Since it is not the first reference to the cable, COMET uses the discourse history to plan a modifying description. A reference to the cable ts discovered in the history (its entry is shown in Figure 12) and the action in this entry is selected as the modifier to build a referring expression. 8 The role of the cable was medium and thus, COMET can generate the modifier as a relative clause. The LF for this referring expression is shown in Figure 13. This LF is sent back to the lexical chooser, which selects the words for the concepts within it, and continues with generation where it left off. On third and fourth reference to the same concept, COMET uses its anaphoric reference facility to generate either a bare head (e.g., &amp;quot;cable&amp;quot;) or a pronoun (e.g., &amp;quot;it' ').  COMET performs several lexical choice tasks. It can choose between alternative words or phrases for any part of speech. When generating a request to perform an action, it chooses a level of detail in the concept description appropriate to the user. When generating both initial and subsequent referring expressions, it selects a set of distinguishing properties of the referent and chooses words to express the selected  properties, Finally, for subsequent references, COMET can use previous discourse to avoid unknown words.</Paragraph>
    <Paragraph position="3"> COMET is thus using constraints from the user model, the accompanying illustration, and past discourse in addition to traditional constraints from semantics, syntax, and other word choices. Although other generation systems take into account some of these constraints, COMET is the first attempt to integrate such a variety of constraints and lexical choice strategies in a single system. In addition, because COMET is a multimedia system, it can use the accompanying illustrations advantageously for disambiguation. null WIP \[20\] can also generate cross references but does not rely. on a user model for either cross reference eneratlon or lexical choice. EPICURE \[19\], KAMP 5\], and FN \[13\] tailor references based on situation, but they do not constrain this choice based on the user's lexical knowledge. EPICURE uses the user's domain knowledge, KAMP mutual beliefs about the domain, and FN the user's domain knowledge in conjunction with rules on implicatures. They focus on the selection of appropriate properties to distinguish an object in generating references but do not choose between alternative wordings for the selected properties. None of these systems reword action descriptions or use past discourse to avoid terms the user does not know. While Bateman and Paris' system \[21\] uses different dialects depending on which class of users it is addressing through register mappings, in COMET different terms can be mixed and matched depending on the individual user model.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML