File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/94/p94-1007_metho.xml

Size: 19,837 bytes

Last Modified: 2025-10-06 14:13:55

<?xml version="1.0" standalone="yes"?>
<Paper uid="P94-1007">
  <Title>GENERATING PRECONDITION EXPRESSIONS IN INSTRUCTIONAL TEXT</Title>
  <Section position="4" start_page="0" end_page="42" type="metho">
    <SectionTitle>
CORPUS ANALYSIS
</SectionTitle>
    <Paragraph position="0"> The corpus developed for this study contains approximately 1000 clauses (6000 words) of instructions taken from 17 different sources, including instruction booklets, recipes, and auto-repair manuals. It contains 98 precondition expressions, where the notion of precondition has been taken from Rhetorical Structure Theory (Mann and Thompson, 1988): and in particular from RSsner and Stede's modified relation called Precondition (1992). This relation is a simple amalgam of the standard RST relations Circumstance and Condition and has proven useful in analyzing various kinds of conditions and circumstances that frequently arise in instructions.</Paragraph>
    <Paragraph position="1"> The analysis involves addressing two related issues:  1. Determining the range of expressional forms commonly used by instructional text writers; 2. Determining the precise comnnmicative context  in which each of these forms is used.</Paragraph>
    <Section position="1" start_page="42" end_page="42" type="sub_section">
      <SectionTitle>
Instructhmal
Text
</SectionTitle>
      <Paragraph position="0"> Determining the range of forms was a matter of cataloging the fl~rms that ~mcurred in the corpus. Example (1) shows exemplars of the major forms found, which include present tense action expressions (la), agentless passives (lb), relational expressions of resultant states (lc): phrasal forms, (ld), and separated iml,erative forms (le).</Paragraph>
      <Paragraph position="1"> Determining the functional context in whidl each ~,f the forms is used inw~lves identifying correlations between the contextual features of communicative context on the -he hand, and the lexical and grammatical form on the other. I focus here on the range of lexical and gralnmatical forms corresponding to the precondition expressions in the corpus. The analyst begins by identifying a feature of the communicative context that appears to correlate with the variation of some aspect of tlle lexical and grammatical forms. They then attempt to validate the hypothesis by referring to the examples in the corpus. These two phases are repeated until a good match is achieved or until a relevant hypothesis cannot be found.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="42" end_page="42" type="metho">
    <SectionTitle>
IMAGENE
</SectionTitle>
    <Paragraph position="0"> The analysis has resulted in a number of identified covariations which have been coded in the System Network formalism from Systemic-Functional Linguistics (Halliday, 1976) and included in the IMAGENE architecture. The system network is basically a derision network where each choice point distinguishes between alternate features of the communicative context. It has been used extensively in Systemic Linguistics to address both sentence-level and text-level issues. Such networks are traversed based on the appropriate features of the comnmnicative context, and as a side-effect, of this traversal, linguistic structures are constructed by realization ~statemcnts which are associated with each feature of the network. These statements allow several types of manipulation of the evolving text structure, including the insertion of text structure nodes, grammatical marking of the nodes, textual ordering, and clause combining. Currently, the network is traversed manually; the data structures and code necessary to automatically navigate the structure have not been implemented. This has allowed me to focus on the contextual distinctions that need to be made and on their lexical and grammatical consequences.</Paragraph>
    <Paragraph position="1"> The general architecture of IMAGENE, as depicted in Figure 1. consists of a System Network and a Sentence Building routine, and is built on top of the Penman text generation system (Mann, 1985). It transforms inputs (shown on the left) into instructional text (shown on the right).</Paragraph>
    <Paragraph position="2"> The following sections will detail the results of the analysis for precondition expressions. It should be noted that they will include intuitive motivations for the distinctions made in the system network. This is entirely motivational; the determinations made by the systems are based solely on the results of the corpus analysis.</Paragraph>
  </Section>
  <Section position="6" start_page="42" end_page="43" type="metho">
    <SectionTitle>
PRECONDITION SLOT
</SectionTitle>
    <Paragraph position="0"> In the corpus, preconditions are typically fronted, and therefore the sub-network devoted to precondition expression will default to fronting. There are four exceptions to this default which are illustrated here: (2a) The BATTER?*&amp;quot; LOW INDICATOR will light when the battery is the handset is low.</Paragraph>
    <Paragraph position="1"> (Excursion, 1989)  (2b) Return the OFF/STBY/TALK switch to STBY after your call. (Code-a-phone, 1989) (2c) The phone will ring only if the handset 'is on the base. (Code-a-phone, 1989) (2d) In the STBY (standby) position, the phone  will ring whether the handset .is on the base or in another location. (Code-a-phone, 1989) The slot selection fi~r example (2a) couht go either way; except that it is the first sentence in a section titled ':Battery Low Indicator&amp;quot;, making the discussion of this indicator the local topic of conversation, and thus the appropriate theme of the sentence. This distinction is made in the portion of the system network shown in figure 2. This sub-network has a single system which distinguishes between preconditions associated with actions referring to thematic material and those associated with non-thematic material. The realization statement, Nucleus&gt;Precond, indicates that the main action associated with the condition (called the nucleus in RST terminology) is to be placed before the precondition itself.</Paragraph>
    <Paragraph position="2"> The slot determinations for the remainder of example (2) are embedded in system networks shown later in this paper. Example (2b) is the example of what I call rhetorical demotion. The action is considered obvious and is thus demoted to phrase status and put at the end of its immediately following action. Examples (2c) and (2d) show preconditions that are not fronted because of the syntax used to express the logical nature of the precondition. In (2c), the condition is expressed as an exclusive condition which is never fronted. One could, perhaps; say &amp;quot;?? Only if the handset is on the base, will the phone ring.&amp;quot; 2 but this form is never used in the corpus. Neither is the condition form in (2d) ever fronted in the corpus.</Paragraph>
    <Paragraph position="3"> 2The &amp;quot;??&amp;quot; notation is used to denote a possible form of expression that is not typically found in the corpus; it does not indicate ungrammaticMity.</Paragraph>
  </Section>
  <Section position="7" start_page="43" end_page="44" type="metho">
    <SectionTitle>
PRECONDITION LINKER
</SectionTitle>
    <Paragraph position="0"> Preconditions are marked with a number of linkers, illustrated in the IoUowing examples: (3a) Lift the handset and set the OFF/STBY/TALK \[8\] switch to TALK. When you hear dial tone, dial the number on the  Dialpad \[4\]. (Code-a-phone, 1989) (3b) If you have touch-tone service, move the TONE/PULSE SWITCH to the Tone position.</Paragraph>
    <Paragraph position="1"> (Excursion, 1989) (3c) I. Make sure the handset and base antennas are fully extended. 2. Set the OFF/STBY/TALK SWITCH to Talk.</Paragraph>
    <Paragraph position="2"> (Excursion, 1989)  The systems largely dedicated to selecting precondition linkers are shown in figure 3. 3 Two parallel systems are entered, Condition- null Condition-Probability distinguishes actions which are probable from those which are not. Highly probable actions are typicaUy marked with ':when&amp;quot;. Those actions which are not highly probably are marked with &amp;quot;If&amp;quot; or some similar linker, as determined by the Complexity system and its descendants.</Paragraph>
    <Paragraph position="3"> The Complexity system is entered for actions which are not probable and not changeable. It determines the logical nature of the preconditions and sets the linker accordingly. The three possible linkers chosen by this sld:)-network are ':if&amp;quot;; &amp;quot;only if&amp;quot;, or &amp;quot;whether... or... &amp;quot;. Precond-When is entered when the action is conditional and further is highly probable. The occurrence of the dial tone in example (3a) is part of a sequence of actions and is conditional in that it nlay not actually happen, say if the telephone system is malflmctioning in some way, but is ntmetheless highly probable. Precond-Nominal is entered immediately after Precond-When whenever the precondition is being stated as a nominalization. It overwrites the linker choice with ':after&amp;quot; in only this case.</Paragraph>
    <Paragraph position="4"> Preconditions that the user is expected to be able to change if necessary and which come at the beginning of sections that contain sequences of prescribed actions are called Change.able preconditions. Example (3c) is such a case. Here, the reader is expected to check the antennas and extend them if they are not already extended. This 3In the figure, the bold-italic con(htious attached to the front of these systems denote conditions that hold on entry (e.g., ConditionM-Action is a condition trite on the entry of Condition-Probability), They axe necessary because the networks shown are only portions of a much larger network.</Paragraph>
    <Paragraph position="5">  type of precondition is marked as a &amp;quot;Make sure&amp;quot; imperative clause by Changeable-Type.</Paragraph>
  </Section>
  <Section position="8" start_page="44" end_page="46" type="metho">
    <SectionTitle>
PRECONDITION FORM
</SectionTitle>
    <Paragraph position="0"> As noted above, preconditions can be expressed as either a terminating condition or as an action.</Paragraph>
    <Paragraph position="1"> The choice between the two is made by the form selection sub-networks: shown in figures 4 and 5.</Paragraph>
    <Paragraph position="2"> This choice depends largely upon the type of action on which the precondition is based. The actions in the corpus can be divided into five categories which affect the grammatical form of pre- null The first four actions are special categories of actions that have varying act and terminating condition forms of expression. The last category, other actions: encompasses all actions not falling into the previous four categories. The sub-network which distinguishes these forms is shown in figure 4. This section wiU discuss each category in turn: starting with the following examples of the first four action types:  (4a) Listen for dial tone, then dial AREA CODE + NUMBER slowly (Airfone, 1991) (4b) If you have touch-tone service: move the TONE/PULSE SWITCH to the Tone position.</Paragraph>
    <Paragraph position="3"> (Excursion, 1989) (4c) The phone will ring only if the handset is on the base. (Code-a-phone, 1989) (4d) /f you leave the OFF/STBY/TALK \[8\]  switch in TALK: move the switch to PULSE: and tap FLASH \[6\] the next time you lift the handset; to return to PULSE dialing mode.</Paragraph>
    <Paragraph position="4"> (Code-a-phone, 1989) Monitor actions, as shown in example (4a), concern explicit commands to monitor conditions in the environment. In this case, readers are being commanded to listen for a dial tone: with the underlying assumption that they will not continue on  with the instructions unless one is heard. Giving and Placing actions, however, tend to be expressed as terminating conditions, as shown in (4b) and (4c). The corpus does not include active forms of these actions: such as &amp;quot;?? If the phone company has given you touch-tone service, do ...&amp;quot; or &amp;quot;?? Do ... if you have placed the handset on the base.&amp;quot; An Habitual decision is a decision to make a practice of performing some action or of performing an action in some way. When stated as preconditions, they take the present tense form in (4d). Taken in context, this expression refers not to a singular action of leaving the OFF/STBY/-TALK switch in TALK position; but rather to the decision to habitually leave it in such a state. The singular event would be expressed as &amp;quot;If you have left the OFF/STBY/TALK switch in TALK, do ...&amp;quot; which means something quite different from the expression in (4d) which is stated in present tense.</Paragraph>
    <Paragraph position="5"> The bulk of the preconditions in the corpus (70.4%) are based on other types of actions. These types are distinguished in figure 5. In general, the Other Effective Action systems are based on the actor of the action. Reader actions are expressed either as present tense passives or as present tense actions, depending upon whether the action has been mentioned before or not. These distinctions are made by the gates Repeated-Reader and Not-Repeated-Reader. An example of the former can be found in (5a), (':When the 7010 is installed&amp;quot;). In the corpus, such expressions of actions already detailed in the previous text take the present tense, agentless passive form. If the reader action is not a repeated mention, a simple present tense active form is used, as in example (5b).</Paragraph>
    <Paragraph position="6"> (5a) When the 7010 is installed and the battery has charged for twelve hours, move the OFF/STBY/TALK \[8\] switch to STBY.</Paragraph>
    <Paragraph position="7"> (Code-a-phone, 1989) (5b) /f you make a dialing error, or want to make another call immediately, FLASH gives you new dial tone without moving the OFF/STBY/TALK switch. (Code-a-phone, 1989) The Act-Hide system and its descendants are entered for non-obvious, non-reader actions. There are four basic forms for these precondition expressions, examples of which are shown here:  (6a) If light flashes red, insert credit card again (Aiffone, 1991) (6b) When you hear dial tone, dial the number on the Dialpad \[4\]. (Code-a-phone, 1989) (6c) The BATTERY LOW INDICATOR will  light when the battery in the handset is low.</Paragraph>
    <Paragraph position="8"> (Excursion, 1989) (6d) When instructed (approx. 10 sec.) remove phone by firmly grasping top of handset and pulling out. (Airfone, 1991) Act-Hide distinguishes actions which are overly complex or long duration and those that are not. Those which are not will be expressed either as present tense actions, as the one in example (6a), if the action form is available in the lexico-grammar. Active-Available makes this determination. If no action form is available, then Inception-Status is entered. If the inception of the action is expected to have been witnessed by the reader, then the present tense sensing action form is used, as shown in example (6b).</Paragraph>
    <Paragraph position="9"> Termination-Availability is entered either if the action is to be hidden or if the inception of the action was not expected to be experienced by the reader. In these cases, the relational form of the terminating condition is used if it is available. An example of this is shown in example (6c). The long duration action of the battery draining is not expressed in the relational form used there. If the relational form is not available, the present tense, agentless passive is specified, as shown in example (6d).</Paragraph>
    <Paragraph position="10"> Finally, if an action being expressed as a pre-condition is considered obvious to the reader, the nominalization is used, provided its nominalized form is available in the lexicon. Example (ld) is an example of such an expression.</Paragraph>
  </Section>
  <Section position="9" start_page="46" end_page="48" type="metho">
    <SectionTitle>
VERIFYING IMAGENE'S
PRESCRIPTIONS
</SectionTitle>
    <Paragraph position="0"> This study has been based primarily on an analysis of a small subset of the fitll corpus: namely on the instructions for a set of three cordless telephone manuals. This training set constitutes appro~mately 35% of the 1000 clause corpus. The results of this analysis were implemented in IMAGENE and tested by manually re-running the system network for all of the precondition expressions in the training set. These tests were performed without the Penman realization component engaged: comparing the text structure output by the system network with the structure inherent in the corpus text. A sample of such a text structure: showing IMAGENE:s output when run on the actions expressed in the text in example (7)., is shown in figure 6. The general structure of this figure is reflective of the underlying RST structure of the text.</Paragraph>
    <Paragraph position="1"> The nodes of the structure are fitrther marked with all the lexical and grammatical information relevant to the issues addressed here.</Paragraph>
    <Paragraph position="2"> (7) Wh, en the 7010 i.~ installed and the battery has charged for twelve hours; move the OFF/STBY/TALK \[8\] switch to STBY. The 7010 is now ready to use. Fully extend the base antenna \[12\]. Extend the handset antenna \[1\] for telephone conversations. (Code-a-phone, 1989) Statistics were kept on how well IMAGENE:s text structure output matched the expressions in the corpus with respect to the four lexical and grammatical issues considered here (i.e.: slot: form; linker: and clause combining). In the example structure, all of the action expressions are specified correctly except for the Charge action (the second clause). This action is marked as a present tense passive, and occurs in the corpus in present perfect form.</Paragraph>
    <Paragraph position="3"> In fi|ll realization mode: IMAGENE translates the text structure into sentence generation commands for the Penman generation system: producing the following output for example (7): (8) PiOten the phone is installed, and the battery is charged, move the OFF/STBY/TALK switch to the STBY position. The phone is now ~eady to use. Extend the base antenna. Extend the handset antenna for phone convez:~ation.</Paragraph>
    <Paragraph position="4"> As just mentioned, this text identical to the original with respect to the four lexical and grammatical issues addressed in the corpus study with  the exception of the second clause. There are other differences: however; having to do with issues not addressed in the study; such as referring expressions and the expression of manner. A corpus study of these issues is yet to he performed.</Paragraph>
    <Paragraph position="5"> The overall results are shown in table 7 (see Vander Linden, 1993b for the results concerning other rhetorical relations). This (:hart indicates the percentage of the prec.ndition examples for which IMAGENE:s predic:tions matched the c(~rpus for each of the four lexical and grammatical issues considered. The values for the training and testing sets are differentiated. The training set results indicate that there are patterns of expression in cordless telephone manuals that can he identified and implemented.</Paragraph>
    <Paragraph position="6"> The system's predictions were als. tested on a separate and m(~re diverse portion ,,f the corpus which includes instructions for different types of devices and processes. This additional testing serves both to disallow over-fitting of the data in the training portion: and to give a measure of how far beyond the telephone domain the predictions can legitimately he applied. As (::an be seen in figure 7; the testing set results were not as good as those for the training set. hut were still well above random guesses.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML