File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/88/j88-2004_metho.xml
Size: 70,054 bytes
Last Modified: 2025-10-06 14:12:11
<?xml version="1.0" standalone="yes"?> <Paper uid="J88-2004"> <Title>ASPECT, ASPECTUAL CLASS, AND THE TEMPORAL STRUCTURE OF NARRATIVE</Title> <Section position="3" start_page="0" end_page="0" type="metho"> <SectionTitle> 2 VARIETIES OF TEMPORAL KNOWLEDGE AND COMPOSITION OF H-TYPES </SectionTitle> <Paragraph position="0"> This and the next two sections summarize and elaborate on the proposals of Nakhimovsky 1986, 1987a. Commonscnse knowledge about time falls into three categories: compositional, durational, and aspectual. Compo30 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspeetual Class, and the Temporal Structure of Narrative sitionally, h-type representations in the lexicon consist of the following components: a. Preparatory stage.</Paragraph> <Paragraph position="1"> b. Initial stage.</Paragraph> <Paragraph position="2"> c. Body.</Paragraph> <Paragraph position="3"> d. Final stage.</Paragraph> <Paragraph position="4"> e. Resulting stage.</Paragraph> <Paragraph position="5"> In Nakhimovsky 1986 only components b--e were recognized, but Moens and Steedman 1987 (also this volume) convincingly shows that a needs to be added, at least when the h-token is a planned human action. Components b--e are probably uncontroversial, except in certain cases, discussed below, where they are collapsed together. For example, consider the verb cross, as in cross the road. It describes movement from one side of a ribbonal surface to another, following a trajectory that is usually at approximately the right angle to the edge of the surface. The stages are: be located on one side, cross the edge, move across, cross the other edge, be located on the other side. So, as Steedman and Moens observe, the clause When the children crossed the road.., can be followed by any of the following: Example 1 a. They waited for the teacher to give a signal. b. They stepped onto its concrete surface as if it were about to open and swallow them.</Paragraph> <Paragraph position="6"> c. They were nearly hit by a car.</Paragraph> <Paragraph position="7"> d. They reached the other side stricken with fear. e. They found themselves surrounded by strangers. Moens and Steedman argue, correctly I think, that the different temporal relations between the when clause and the continuations do not result from the ambiguity, or polysemy, of the temporal connective: the when clause simply makes both the entire h-token and all its component stages available for focusing. What Example 1 does show is the weak aspectual typing of English verb forms: the same simple past form crossed makes all the stages of the h-type accessible. The most that can be said is that before some, but not all, of the continuations in Example 1, the simple past can be replaced by the progressive. In languages with strong aspectual typing an imperfective verb (form) would have to be used in the when clause before (lb) or (lc).</Paragraph> <Paragraph position="8"> This example points up what I believe to be the main function of aspect as a grammatical category: making either an h-type in its entirety, or (some subsequence of) its internal stages, available for focusing. In order to make the notion of focusing more precise I will use the notion of Reference Time (RT), adopted from Reichenbach 1947 but reinterpreted pragmatically: RT is to be thought of as a point or a very short interval that indicates where the current focus of attention is located. It may be obvious that after (la) RT is located in the middle of the preparatory stage, after (lb) in the beginning of the body right after the initial stage, after (lc) in the middle of the body, after (ld) in the resulting stage right after the final stage, and after (le) in the resulting stage. If, by the end of the sentence, the RT is positioned inside the body of the h-token described by the sentence I will say that the sentence has the imperfective aspectuai perspective; if the RT is positioned after the final stage or, more rarely, before the initial stage, I will say that the sentence has the perfective aspectual perspective. In Example 1, the initial when clause with a verb in the simple past leaves the aspectual sentence perspective underdetermined, which is what I meant by weak aspectual typing of English.</Paragraph> <Paragraph position="9"> Note that the RT never ends up in the middle of the initial or final stages, because they are too short, much shorter than the body; they belong to a different time scale. This does not mean that they are necessarily instantaneous in an absolute sense: they are simply shorter than the body by at least a constant factor, and in a different sentence they can still be presented imperfectively. Consider Example 2: Example 2 a. We spent the whole of last week working on Bob's project.</Paragraph> <Paragraph position="10"> bl. In the beginning...</Paragraph> <Paragraph position="11"> b2. As we began working on it...</Paragraph> <Paragraph position="12"> b3. When we were just beginning our work...</Paragraph> <Paragraph position="13"> The verb begin followed by an infinitive or gerund is a standard way of referring to the initial stage of a history. (Sometimes a lexical item is available as well or instead: &quot;to begin to be asleep&quot; is idiomatically expressed as fall asleep.) The second sentence of Example 2 thus puts the RT in the middle of the initial stage of the h-token described by the first sentence. Note that if in (2a) we replace the word week with month, year, decade, or night, the duration of the beginning varies with the duration of the whole while remaining much shorter than the body.</Paragraph> <Paragraph position="14"> The fact that we can so &quot;zoom in&quot; on different stages of a history has led many to conclude that there is no absolute sense in which events can be described as instantaneous. As Allen and Kautz 1985:253 puts it, &quot;we can always 'increase the magnification' and find more structure&quot; (see also Dowty 1986, Kamp 1979). I believe that instantaneousness is an absolute quality determined by our biology: instantaneous, events are those that are not perceived by humans as possessing internal structure. Languages select such events for special treatment by disallowing the &quot;imperfective description&quot; of them: one cannot use the imperfective aspect to place the RT in the middle of an instantaneous event, so that The light was flashing does not place the RT inside an individual flash. Nor can the begin plus gerund construction refer to the initial stage of an instantaneous event: compare He began crossing the street with He began blinking. The initial stage, the body and the final stage of an individual blink or flash or Computational Linguistics, Volume 14, Number 2, June 1988 31 Alexander Nakhimovsky Asl,ect, Aspectual Class, and the Temporal Structure of Narrative any other instantaneous event are thus inaccessible for future focusing, and it makes sense to represent them as all collapsed together. 2 Note that the class of instantaneous events can be defined from three different perspectives corresponding to the three varieties of temporal knowledge postulated in the beginning of this section. Durationally, instantaneous events belong to a time scale that is below a certain threshold of human perception of time (cf.</Paragraph> <Paragraph position="15"> Fraisse 1984:29-30). Compositionally, they have impoverished internal structure. Aspectually, this shows up in imperfective forms of verbs describing instantaneous events, which are interpreted differently than such forms of verbs describing noninstantaneous histories.</Paragraph> </Section> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> 3 KNOWLEDGE ABOUT DURATIONS </SectionTitle> <Paragraph position="0"> Durations can be entered in the lexicon in the following three ways that are not mutually exclusive: a. For some h-types (e.g., lecture, shower, lunch) we know the duration of their h-tokens to be stable, albeit approximate. Such durations can be entered in the lexicon directly as a fuzzy number, e.g., lecture \[1 2 hour\] (cf. Allen and Kautz 1985). This is a simple and least interesting case.</Paragraph> <Paragraph position="1"> b. For most aspectual classes of histories, duration is constrained by qualitative functional dependencies (Forbus 1985) among the participants of the situation; so, the time it takes to read a text depends on its length and genre, and the proficiency of the reader. This case is discussed in Section 4, after the notion of aspectual class is introduced.</Paragraph> <Paragraph position="2"> c. For a majority of h-types, constraints on their durations can be expressed using the notion of time scale, introduced below.</Paragraph> <Paragraph position="3"> In order to measure durations one needs cyclical events. These may be celestial, biological, or manmade. (See Campbell 1986 for a lively discussion.) Time-measurement units provided by cyclical events form several interlocking, multitiered hierarchies. It is commonly assumed in AI literature that the choice of a measurement unit is arbitrary and independent of the interval to be measured, because we can always switch from one unit to another by using axioms of the sort hour = 3600 seconds. Mathematically, of course, this is true, but I don't think language quite works this way: when we say It took Bob an hour to repair the faucet, we don't mean that it took him 3600 seconds. Given an interval, we choose a unit to measure it with so that the duration is expressed by a &quot;reasonable&quot; number, that is, not too big. Small fractions are also avoided: we say half a month or a quarter of an hour, but not one-fifth, -sixth, and so on. (Note that a quarter is the smallest fraction for which there is a special word; the names for the rest of them are formed by a regular morphological process.) Under these (psychologically) very understandable restrictions it is rarely the case that an interval fit ~ \]precisely into the measurement unit we use. This results in a certain degree of fuzziness: when we say that something happened a year ago, we typically don't claim to be precise to the day. The same imprecision is observed when we use man-made time units such as hours minutes, and seconds. The sentences Joe slept for th,ee hours, or Joe slept from 2 to 5 do not claim precision to the minute; in fact, the second sentence can be continued with He woke up at a quarter after 5 feeling greatly refreshed, and the narrative does not become self-contradictory.</Paragraph> <Paragraph position="4"> These observations suggest that a duration expressed by a number in hours becomes expressed by a fuzzy number (an interval) in minutes: \[3 3 hour\] becomes something like \[160 200 minute\]. (In a richer theory, the degree of fuzziness would depend on what we know about the source of information.) Since the boundaries of a fuzzy interval are themselves fuzzy, it might seem that we ought to throw in some seconds at the ends of the \[2,50 300\] minutes interval. However, the duration would then be expressed in thousands of seconds, and commonsense reasoning is too qualitative to deal with such big numbers. The length of the interval suggests a measurement unit, which, in turn, establishes a certain scale, and durations that are much smaller than this scale are disregarded.</Paragraph> <Paragraph position="5"> It tiros makes sense to talk of units of measurement that a:'e &quot;natural&quot; for a given interval: measured in a natural unit, the length of the interval will not be a very small :fraction (greater than some constant R) or a very big nttmber (less than some constant N). I wouldn't want to commit myself to specific values for R and N, but psychologists seem to have a fairly clear idea of what they are (see, e.g., Fraisse 1984). The time scale of an interval can then be characterized by the sequence of its nat aral units, a subsequence of all the duration units. An interval's duration can be represented by fuzzy numbers, e.g., \[1 1 year\] &quot;a year, about a year&quot;; \[1.5 1.5 week\]. &quot;a week and a half&quot;; \[2 3 day\] &quot;two to three days&quot; and so forth. There are rules that transform fuzzy durations in one unit into fuzzy durations in a smaller unit, but only as long as both units are natural to the interval being measured: \[1.5 1.5 week\] becomes \[9 12 day\], but not \[200 260 hour\].</Paragraph> <Paragraph position="6"> To .zapture the intuition that hours are simply irrelevant when we talk about weeks, or days when we talk about years, I introduce the relation much-greater-than, which is readily defined in terms of natural units: i is much greater thanj if and only if (iff) the largest natural unit ofj is smaller than the smallest natural unit for i.</Paragraph> <Paragraph position="7"> There are thus two ordering relations defined on durations :neasured in the same scale: greater-than and much-greater-than; together, they give rise to an equi (indistlnguishability) relation (cf. Hobbs 1985, 1986b) defined as follows: two intervals' durations are equi iff the smaller of the two durations is much greater than the difference between them. Note that in this scheme of 32 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative things the equi relation is derived, rather than primitive as in Hobbs's work, and it is not incompatible with greater-than. I find it both convenient and intuitively correct that we can treat two intervals as equivalent without losing the knowledge, if we have it, that one is greater than the other.</Paragraph> <Paragraph position="8"> Note that the notion of natural units presented here is related to McDermott's (1982) lifetimes: saying that the natural unit for property p is year means that the property lasts (or persists) for years. It is also related to Hobbs's (1985a) grain. What I propose is that lifetimes and grains are monotonically related: the greater the lifetime, the greater the grain. Thus the same parameter, natural unit, can capture both.</Paragraph> <Paragraph position="9"> Duration units cut up a continuous parameter (time) into a finite set of quantities. The set of duration units, with the greater-than and much-greater-than relations on it, thus forms a quantity space for time, in the sense of Forbus 1985:96-7. We can communicate because our reasoning systems employ the same temporal quantity space, whose elements are arbitrary in precisely the same sense in which linguistic signs are arbitrary: each one of them could have been different, yet none of them can be changed by the individual speaker/reasoner, who acquires them in the process of cognitive and linguistic development. The same progression of ideas applies, with some modifications, to other one-dimensional scalars: distance, temperature, price, height/tallness, and speed.</Paragraph> <Paragraph position="10"> Given the role of time-measurement units in the proposed framework, the task of axiomatizing the clock and the calendar, mentioned in Hobbs 1985, becomes highly important. It involves identifying the time-measurement units, defining them in terms of cyclical events, and establishing the much-greater-than relation on them. There is some discussion of the task in Nakhimovsky 1986; here I can give just one illustration of the relevant empirical arguments. Apart from measurement units with single word names, there are multiples of other units, described by phrases, that have themselves become measurement units. Five minutes provides an example: we talk about 10 or 15 or 20 minutes much more commonly than 11 or 14 or 17 minutes, and when asked to estimate a duration that is less than an hour we typically use a multiple of five minutes. (Note that this is not true about hours or days.) If we don't, an unusual precision is implied; compare: Example 3 a. I'll be back in 15 minutes.</Paragraph> <Paragraph position="11"> b. I'll be back in 14 minutes.</Paragraph> <Paragraph position="12"> There's much more of a broken promise if I'm one minute late after (3b) than (3a), which indicates a finer grain (smaller scale) in (3b).</Paragraph> <Paragraph position="13"> Clearly, five minutes have become a unit because a minute is a unit, and we have five fingers on each hand.</Paragraph> <Paragraph position="14"> This is yet another example of the anthropocentrism pervading language and commonsense reasoning, the same antropo-centrism that was mentioned earlier in connection with instantaneous events.</Paragraph> </Section> <Section position="5" start_page="0" end_page="0" type="metho"> <SectionTitle> 4 ASPECT </SectionTitle> <Paragraph position="0"/> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 4.1 GRAMMATICAL ASPECT </SectionTitle> <Paragraph position="0"> In what follows it is essential to keep the following three concepts apart: aspect as a grammatical category of the verb, implemented by affixes, auxiliaries, and such; aspectual class, which is a characteristic of an h-type or lexical meaning; and the aspectual perspective of the sentence determined by the position of the RT with respect to the h-token described by a finite clause. Thus grammatical aspect is associated with linguistic expressions, aspectual class with h-types, and aspectual perspective with h-tokens. It is the aspectual perspective of the sentence that is ultimately important for understanding; both grammatical aspect and aspectual class sometimes uniquely determine, sometimes just strongly constrain, the aspectual perspective. In English, for instance, the progressive aspect guarantees that the sentence perspective is imperfective, except for sentences describing instantaneous events, which, in any language, cannot be presented imperfectively. (As the discussion in the end of Section 2 put it, He was blinking or He started blinking does not put the RT in the middle of an individual blink.) This does not mean that verbs like blink are grammatically perfective.</Paragraph> <Paragraph position="1"> There are languages, most notably Slavic, where the difference in sentence perspective is hard-wired into verb morphology: simplifying very slightly, every Russian verb is either perfective or imperfective (with the diagnostics being that only imperfective infinitives can follow verbs like begin or continue), and the morphological feature of the verb determines the aspectual perspective of the sentence. In English, by contrast, simple past is fairly noncommittal as to the aspectual perspective of the sentence, as we saw in the discussion of Example 1. Another way to describe this contrast is to say that Russian has a perfective-imperfective aspectual system, while English has an imperfectiveunmarked aspectual system; most aspectual systems seem to belong to one of these two kinds, with about three-to-one advantage for the perfective-imperfective systems in Dahl's (1985) analysis of 64 languages.</Paragraph> <Paragraph position="2"> A fairly common misconception is to assume that since English has an imperfective, it ought to have a perfective also. Obviously, simple past, without any further qualifications, cannot be considered perfective, in view of such obvious examples as Bob owned a house on Cape Cod. However, sentences like Vanessa read a magazine article about Mongolia do suggest the perfective perspective: by the end of the sentence the article seems to have been read (cf. Hinrichs 1986:68; Dowty 1986:46-8 for similar examples and discussion). I argue that it would be incorrect to call the verb or the verb phrase in such sentences perfective, because the perfective sentence perspective emerges not so much from Computational Linguistics, Volume 14, Number 2, June 1988 33 Alexander Nakhimovsky Asp ~ct, Aspectual Class, and the Temporal Structure of Narrative the linguistic properties of the verb phrase as from commonsense knowledge about the situation it describes: a short process that progresses towards a well-defined terminal point beyond which it cannot continue. To see that we are not dealing with a grammatical category but rather commonsense inference, observe that an appropriate context can change the sentence perspective, as in Example 4, where Vanessa's reading is interrupted by a doorbell: Example 4 After supper, Vanessa and Didi sat down in the living room. Vanessa read a magazine article about Mongolia, Didi watched her favorite cartoon on television. Suddenly the doorbell rang.</Paragraph> <Paragraph position="3"> I conclude, in accord, I believe, with Passonneau (1987 also this volume), that English has no morphological perfective; the perfective effect arises out of the aspectual properties of the history being described, i.e., its aspectual class.</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 4.2 ASPECTUAL CLASSES OF HISTORIES </SectionTitle> <Paragraph position="0"> An aspectual classification divides lexical meanings according to the internal temporal structuring of the corresponding h-tokens. I believe they should be thought of as intrinsic properties of lexical meanings, independent of context and, in particular, aspectual perspective. Even out of context we know a good deal about how instances of, e.g., reading or sleeping or walking to the store are individuated and how they unfold in time. This knowledge interacts with, but is not determined by, whether the h-token is presented imperfectively, as in John was walking to the store.</Paragraph> <Paragraph position="1"> The major division among noninstantaneous histories, recognized at least since Aristotle, is between process (energeia) and state (stasis). In recent times, Vendler (1967) proposed a highly influential classification that is still commonly accepted, although the principles of classification have changed. Vendler believed, erroneously, that he was classifying English verbs, rather than their denotations, and he used such language-specific criteria as whether or not a verb has a progressive form (Vendler's statives, such as know, don't). In the Taylor-Dowty model-theoretical version (see, e.g., Dowty 1986), the classification is based on the relationship between the truth value of a sentence at an interval and at its subintervals. So, for instance, a sentence S is stative (denotes a state) iff it follows from the truth of S at an interval I that S is true at all subintervals of I (the so-called subinterval property, cf.</Paragraph> <Paragraph position="2"> Dowty 1986:42).</Paragraph> <Paragraph position="3"> I submit that these criteria cannot possibly capture the real distinctions operative in the workings of human language: these have to relate to something perceived and experienced, rather than truth values (which is not to deny that real distinctions may result in fairly consistent truth-functional properties). It is not accidental that Dowty's own example of a state (sleep) contradicts his definition: we can truthfully say that Bob slept from 10 to ~ even if he got up once to go to the bathroom. My propo:;al is that we take the physical vocabulary of proce.,ses and states seriously, and classify histories accorcLing to their internal dynamics, the stability of their parameters, and the resources they consume. (Part of the internal dynamics, in the presence of a conscious agent, is the degree of required attention and volitional control.) We can then note the distinction between states that do not require any resources to sustain themselves (knowing English, owning a house) and those :hat do (sleep requires a certain amount of sleepiness, which gradually wears out). The subinterval propel~ty holds only for zero-resource states and is, in fact, a simple consequence of their other properties: a state t aat requires no resources, including attention and conscious control, obtains continuously.</Paragraph> <Paragraph position="4"> An important group of resources are those that are consumed by the unconscious and periodic metabolic proce~,ses sustaining a living organism. I call them &quot;genetic resources&quot; because they are not specific to any particular history, but rather to all individuals of a given sort. Resource-consuming states all seem to require only generic internal resources, while within processes, there are those that require only generic resources (walking) and those that require process-specific resources as well: reading, for example, requires not only being awake and not too hungry, but also a text to read. This is the first of the two important divisicns among processes. The other one separates telic processes from atelic ones.</Paragraph> <Paragraph position="5"> Telic processes can be defined as processes that have a buill-in terminal point that is reached in the normal course of events and beyond which the process cannot continue. Two most common, and overlapping, varieties of relic processes are: human activities directed to a specific goal, and processes that consume a specific amount of a process-specific resource. So, reading a magazine article is a telic process not only because reading is a goal-directed activity, but also because an article contains a limited amount of the process-specific resource (reading matter) that gets steadily consumed as the process unfolds. Temporal information about a telic process thus includes a typical rate of progress towards its goal or a typical rate of consumption of its resource.</Paragraph> <Paragraph position="6"> A comparison of the preparatory and the resulting stages of a telic process shows the possible changes that such processes bring about: a telic process can create an object, destroy an object, modify an object, or move a specified amount of material (possibly the mover himself) to a specified destination. A subclass of destruction processes are ingestions, which convert an external resource into an internal one. Moving is understood to include all three of Schank's PTRANS, ATRANS, and MTRANS classes (Schank 1975). Moving also includes gradual (but not instantaneous) changes of state.</Paragraph> <Paragraph position="7"> It has been observed (in Verkyul 1972 and others after him) that the aspectual class of a telic predicate is 34 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative affected, in regular ways, by the properties of its noun-phrase (NP) complements, especially theme. Such distinctions as plural-singular, group-individual, and mass-count are particularly important, as well as the degree of granularity-homogeneity and internal structure. To take a well-known example, Bill crossed the border describes an instantaneous event; The regiment crossed the border describes a noninstantaneous h-token with a beginning, body, and end; and Drug smugglers crossed the border easily describes an activity that continues as long as there are drug smugglers. The aspectual properties of telic h-tokens thus arise compositionally in context; 3 however, it still makes sense to talk of the aspectual clas s of a telic h-type as it manifests itself in h-tokens with atomic individuals as themes.</Paragraph> </Section> <Section position="3" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 4.3 EVENTS, PROCESSES, AND STATES </SectionTitle> <Paragraph position="0"> Now I briefly comment on how the conceptual framework developed here relates to a classification of histories into events, processes, and states (Mourelatos 1981). For a linguist, the distinction event-process is one of aspectual perspective: &quot;The term 'process' means a dynamic situation viewed imperfectively, and the term 'event' means a dynamic situation viewed perfectively&quot; (Comrie 1976:51). The distinction process-state is one of aspectual class. Thus the sentence John was walking to the store describes a process, not a state, because walking in general, and walking to a specified destination in particular, is intrinsically a process; it describes a process and not an event because the h-token is viewed imperfectively. I think the lexicon can be made more consistent and informative if the notion of aspectual class (lexical aspect, Aktionsart) is kept separate from the aspectual sentence perspective.</Paragraph> <Paragraph position="1"> An important piece of evidence for keeping the distinctions event-process and process-state separate is provided by events &quot;made out of&quot; states such as the event described by Bobby took a nap. (Cf. also did some reading, went for a walk.) Collocations of this nature have never, to my knowledge, been discussed in connection with the English aspectual system, precisely because, in English, it is easy to associate telicity (an intrinsic lexical property) and perfectivity (the value of a subjective grammatical category). In languages with a well-articulated perfective aspect a stretch of most atelic activity can be viewed perfectively, i.e., as an event. For instance, He walked (perfective), clearly atelic, would mean something like &quot;He walked for a while/he took a walk&quot;--clearly an event. Atelic histories, of course, lend themselves more readily to the imperfective view; Nakhimovsky 1983:80-2 documents some subtle ways in which &quot;aspectual&quot; languages distinguish between atelic events, which are events simply by virtue of allocating them a limited amount of time, and telic events, which are intrinsically bounded in time by the very manner in which they unfold. Telic events tend to be identified as a natural class because they support perhaps the most common temporal inference we make: given a telic history viewed perfectively, we infer that the end point is reached. (This is made more precise in Section 4.5.) For all that, telicity and perfectivity remain distinct, if not entirely orthogonal, categories .4</Paragraph> </Section> <Section position="4" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 4.4 A CLASSIFICATION OF INSTANTANEOUS EVENTS </SectionTitle> <Paragraph position="0"> Unlike noninstantaneous histories, instantaneous events lack internal structure and have to be classified by comparing the world before and after them. An instantaneous event can terminate either a process or a state, and it can initiate either a process or a state; if it is sandwiched between two processes or two states, the two can be the same or different. The resulting classification is shown in Table 2, next page. It can be seen that in English, most verbs describing instantaneous events fall into the first three groups, where the instantaneous event meets a state. Sometimes the name of the process can be used to describe its beginning, as in She ran, i.e., &quot;set out running.&quot; Among the three groups of instantaneous events, happenings always have a progressive form, and they always describe a sequence of instantaneous h-tokens.</Paragraph> <Paragraph position="1"> Transitions may or may not have a progressive form; if they do, progressive is again associated with plurality, e.g., He's been noticing that... (the implication being that the noticing has taken place on many occasions.</Paragraph> <Paragraph position="2"> Observe the inference that between the occasions there is a return to the initial state of the transition: in order to again notice, i.e., become aware of something, one has to first cease to be aware of it.). Finally, culminations usually have progressive forms that allow two interpretations. One is the familiar idea of plurality: The travelers, one by one, were reaching the summit. The other arises from the nature of culminations, which crown processes with sharply defined end points: the top of a climb, the finish of a race. The progressive of a culmination verb describes an h-token whose lifetime is an open interval properly contained in the time of the process that leads to the instantaneous event. I conclude that the proposed classification, although based on extra-linguistic considerations, captures salient distinctions in the meaning of linguistic forms.</Paragraph> <Paragraph position="3"> All the classifications developed in this section are brought together in tables 1-3, next page.</Paragraph> </Section> </Section> <Section position="6" start_page="0" end_page="0" type="metho"> <SectionTitle> 4.5 ASPECTUAL CLASS, SENTENCE PERSPECTIVE, AND INFERENCE </SectionTitle> <Paragraph position="0"> Recall that the aspectual sentence perspective is determined by the position of the RT, either inside the time of an h-token or after that. In interaction with the knowledge about the internal composition of h-tokens, the position of the RT creates expectations, which can be stated using the notions of time scale and expended resources. Consider first an example: the h-token of Sharon reading a novel can terminate in one of the following four ways: state-E-same state happening flash, cough, jump state-E-different state transition forget, recognize, notice, process-E-state culmination win the race, reach the top process-E-same process disturbance examples state-E-process activation more difficult process-E-different process switch to find</Paragraph> </Section> <Section position="7" start_page="0" end_page="0" type="metho"> <SectionTitle> STATES PROCESSES II </SectionTitle> <Paragraph position="0"> atelie: atelic: I telic generic resource specific resource I natural end: Within a certain time scale, Sharon finished the novel; exhaustion: Within a certain time scale, determined by a different set of resources, Sharon got tired of reading; volition: Sharon decided to put the novel down; interruption: Something happened that made reading impossible: Sharon's friend took the novel away from her, the lights went out, her attention was distracted by loud music.</Paragraph> <Paragraph position="1"> Generalizing from this example, we can say that when the RT finds itself inside an h-token, some or all of the following expectations are set up, depending on the internal dynamics of the h-token and the presence or absence of volitional control: * If the h-token has an Agent participant, it can be interrupted by the Agent's decision to do so. (The decision may turn out to be impossible to implement but the h-token is then transformed into a different one, in which the former Agent perhaps becomes an Object or Patient: think of a driver trying to stop the car and going into a skid.) * If the h-token consumes generic resources (i.e., if it is a process or a resource consuming state) then, within a certain time scale, it comes to an end when the generic&quot; resource is exhausted.</Paragraph> <Paragraph position="2"> * If the h-token is a telic process, then, within a certain time scale, the process will come to its natural end. * Finally, and most difficult to detect, the h-token may come to an end if some of its preconditions no longer hold, causing an interruption of the h-token.</Paragraph> <Paragraph position="3"> The causality involved in interruption may be indirect, and a complete specification of everything that can go wrong is clearly impossible, but no such completeness is observed in human reasoning, either. What we can and ought to do is list in the lexicon the main preconditions for processes and states expressed by lexical items. Assuming that the lexicon is organized as an inheritance hierarchy, a great deal of preconditions will be shared by entire classes of histories. Some examples are given in Nakhimovsky 1987a.</Paragraph> <Paragraph position="4"> The most noticeable expectations are those arising when a telic process is viewed imperfectively, and the logical problems of reasoning with expectations were first observed on such processes. This is Dowty's (1977) &quot;imperfective paradox&quot;, i.e., the problem of giving &quot;an account of how \[John was drawing a circle\] entails that John was engaged in bringing-a-circle-into-existence activity but does not entail that he brought a circle into existence&quot; (p.46). From the perspective of the intervening years it can be seen that Dowty here grapples with the default, nonmonotonic nature of the inferences involved. The progressive sentence puts the RT in the middle of the time of John's activity; if another sentence puts the RT after that interval, we infer, unless or until told otherwise, that the activity came to its natural end. More precisely, the inference works as follows: Telic-lmperfective inference rule: If one sentence in a narrative tells us that a telic history hi has begun or is in progress, and a later sentence moves the RT into or beyond a later history h2, and the time distance between hl and h2 is of the same or a greater scale than the time scale ofhl, then assume, unless or until told otherwise, that the end-point of hl is reached.</Paragraph> <Paragraph position="5"> The rule can be illustrated by the following narrative: In the afternoon, when Johnny was writing a postcard to his friend, Mommy went to the market. In the evening they played together. Knowing how long it takes, we infer that Johnny finished his postcard and Mommy her shopping. However, the inferences are withdrawn if contradicted: the narrative could continue with until Mommy discovered that the postcard never got written.</Paragraph> <Paragraph position="6"> To see the importance of time scale, replace &quot;writing a postcard&quot; with &quot;reading War and Peace.&quot;</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 4.6 DURATIONAL CONSTRAINTS </SectionTitle> <Paragraph position="0"> The examples of the preceding section illustrate two facts about temporal reasoning that I believe are crucial for any implementation of narrative understanding: (a) knowledge about internal constituency of events and temporal relations between them is tightly integrated 36 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative with durational knowledge; (b) durational knowledge includes a great many constraints originating in celestial, biological, and civilization-specific cycles. The first of these facts is recognized and implemented in Allen and Kautz 1985. The implementation suggestions made here all derive from that paper and Forbus 1985. Allen and Kautz also note that &quot;in more complicated situations, certain units (such as the standard time units-minutes, hours, weeks, etc.) can be used as reference durations to keep the network of manageable size&quot; (p.</Paragraph> <Paragraph position="1"> 260). How this ties in with (b) above can be seen by recasting Allen and Kautz's example of a unified (temporal relations + durations) implementation in the present framework. The example in its entirety is as follows (pp. 260-1): Example 5 a. Moe and Larry began reading Principia Mathematica. null b. Moe read for over an hour.</Paragraph> <Paragraph position="2"> c. Larry stopped reading and fell asleep after 10 minutes.</Paragraph> <Paragraph position="3"> Allen and Kautz interpret (5a) to mean that Moe and Larry started reading simultaneously (and may or may not have finished together). But surely, language is not that precise: (5a) can be said about two students who both started reading the book in September for a course they were taking in the fall semester. The most one can say about (5a) is that the time distance between \[began-read Moe\] and \[began-read Larry\] is much-shorter-than the expected duration of reading. This is an instance of a general rule, noted in the end of Section 2, that the duration of an h-token is much-greater-than the durations of its beginning and end.</Paragraph> <Paragraph position="4"> Sentence (5b) supplies both a duration and a time scale. Allen and Kautz's representation again underconstrains: they interpret the sentence to mean that the duration of Moe's reading is (1 ~). An obvious constraint is that Moe read for less than a day, but more precisely &quot;over an hour&quot; implicates &quot;less than two hours&quot;; in general, &quot;over/after n units of time&quot; implicates &quot;less than n + 1 units.&quot; (This is an instance of Horn's (1984) scalar implicature.) Allen and Kautz, in fact, make an implicit use of this implicature when they interpret (5c) to mean that Larry read 10 to 15 minutes (p. 261). The interpretation is correct because 5 minutes is a duration-measurement unit in our language, as mentioned in Section 3 above.</Paragraph> <Paragraph position="5"> It appears that the varieties of temporal knowledge proposed in this paper result in a richer set of constraints that give a more adequate picture of (5) and can handle more complex narratives. They also indicate a fairly clear connection between the task of natural language understanding and the enterprise of naive or qualitative physics (Hayes 1985, 1985a; Forbus 1985).</Paragraph> <Paragraph position="6"> The entire aspectual classification of histories is based on taking seriously the physical vocabulary of processes and states, and the reasoning of the preceding section is an example of Forbus's (1985) limit analysis. In general, nonmonotonic constraint propagation over interacting processes evolving in time is the essence of naive physics. These similarities suggest that qualitative process descriptions of Forbus 1985 can serve as a model for lexical entries describing h-types. (Some examples can be found in Nakhimovsky 1987a.)</Paragraph> </Section> </Section> <Section position="8" start_page="0" end_page="0" type="metho"> <SectionTitle> 5 NARRATIVES AND THEIR REPRESENTATIONS </SectionTitle> <Paragraph position="0"> Discussions of tense and aspect in discourse invariably seem to converge on narrative discourse, rather than conversation or exposition. The reasons, I believe, have to do with several profound, if somewhat speculative, distinctions: between semantic and episodic memory; between on-line perception of time effected by biological clocks and detached time cognition rooted in long-term memory; and between subjective and objective discourse. Briefly and, I repeat, speculatively, exposition contributes to semantic memory while narrative builds an episodic representation; conversation is a breed apart because the time of a conversation coincides with the time of its content, with simultaneity in time typically accompanied by a tight integration between the linguistic and nonlinguistic behavior: the verbalization of how-to-get-there directions is the action of giving directions; task-oriented conversations between an expert and apprentice are an integral part of performing the task at hand, and the unfolding text of an argumentative dialogue is precisely the activity of arguing. Conversation can thus properly be called performatire discourse. By contrast, the content of a narrative is decoupled from the linear progression of its text and unfolds in its own, separate timeline. 5 Whatever the reasons, it is certainly the case that (a) tense, aspect, and mood systems of the verb are frequently more finely nuanced in the past than in the present or future, and never the other way around; (b) the tense of narrative discourse is past, in its many varieties; (c) the past tense in narratives is not interpreted with respect to the situation of discourse. In place of the situation of discourse, a narrative is processed with respect to a constantly maintained deictic center, which is &quot;the locus in conceptual space-time of the objects and events depicted or described by the sentences currently being perceived. At any point in the narrative, the cognitive agent's attention is focused on particular characters (and other objects) standing in particular temporal and spatial relations to each other.</Paragraph> <Paragraph position="1"> Moreover, the agent 'looks' at the narrative from the perspective of a particular character, spatial location and temporal location. Thus the deictic center consists of a WHERE-point, a WHEN-point and a WHO point.&quot; (Bruder et al., 1986:1). In this paper, the WHEN-point of the deictic center is referred to as the temporal focus, or TF (cf. Webber 1987a,b; Nakhimovsky 1987b). This term is adopted because it brings out the similarity Computational Linguistics, Volume 14, Number 2, June 1988 37 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative between definite and temporal anaphora (cf. Webber, this volume, for the genesis of this idea.).</Paragraph> <Paragraph position="2"> Not one but two structures are thus needed to represent a narrative: the event-situation structure (ESS, cf. Webber 1987b), representing the narrative's unfolding contents, and the linear text structure (LTS), whose components are linked by rhetorical relations such as elaboration, resumption, or flashback (see, e.g., Hobbs 1982). Next to ESS and LTS, there must also be a short-term memory structure called current focus space (CFS) and a buffer in which the representation of the current sentence is assembled. Among the components of the CFS are the data structures needed for comprehension of definite anaphora (such as the Focus Stack and Alternate Focus List in Sidner 1983), as well as the components of the current deictic center, which are really pointers to the appropriate nodes of the ESS.</Paragraph> <Paragraph position="3"> After the first sentence of a narrative establishes the initial set-up, each subsequent sentence is processed in the context consisting of the ESS and the CFS. Depending on the sentence's meaning, one of two things happen: either the representation of the sentence is incorporated in the CFS, with the focusing mechanisms appropriately modified; or, in the case of a focus shift, the contents of the CFS are incorporated into the ESS and LTS, and the CFS is completely reset. In terms of text structure, the current sentence either continues the same, or starts a new, discourse segment (DS; cf. Grosz and Sidner 1986).</Paragraph> <Paragraph position="4"> The nature of the processing at the DS juncture is thus quite different from the &quot;routine&quot; tasks to be performed as long as the text remains in the same DS: the start of new DS prompts, and is prompted by, a shift of attention. The circularity here is deliberate. The start of a new DS brings about several changes, some of them more immediately noticeable than others; the more obvious ones serve to signal that a new DS is, indeed, started. Some of the most important such signals come from changes in time scale, aspectual class, and other temporal characteristics of narrated h-tokens. These temporal discontinuities are usually accompanied by discontinuities of other kinds that form recurrent patterns; I suspect, and this is an empirical hypothesis, that rhetorical relations within a narrative text are macrolabels that stand for certain oft-repeated clusters of discontinuities. In any event, the task of segmenting narratives into units that form the basic elements of discourse representations and constrain the application of focusing algorithms is obviously very important. The task is frequently mentioned in work on discourse (see Grosz and Sidner 1986:177-8 and references there), but the available heuristics mostly rely on clue words and phrases. I propose heuristics based on four kinds of discontinuities: discontinuities of topic, discontinuities of space and time, discontinuities of figure and ground, and discontinuities of the narrative perspective (not to be confused with aspectual sentence perspective). I explain these after a preliminary and partial illustration.</Paragraph> <Paragraph position="5"> Consider' Example 6.</Paragraph> <Paragraph position="6"> Example 6 a. Hartley and Phoebe had been sent by their mother to fix the tail vane of the windmill.</Paragraph> <Paragraph position="7"> b. In the great expanse of the prairie where they lived, the high tower of the windmill was the only real landmark. (Worline 1956:1) Rhetorically speaking, the (b) sentence above interrupts a sequence of events (DS1) to start DS2, a description.</Paragraph> <Paragraph position="8"> In order to recognize this rhetorical relation between the two DSs it is necessary to recognize that: a. there is a shift of topic; b. there is a shift in perceptual modality to visual perception; c. there is a shift in time scale from the events of the current day to years or decades, associated with the lifetime of a windmill and the 'where they lived' clause; d. there is a shift in spatial scale from a household to the entire prairie; e. there is a shift from a foregrounded sequence of events to the 'mopping-up' operation of filling the background. The shift is signaled by changes in aspectual class.</Paragraph> <Paragraph position="9"> The following empirical investigation thus suggests itself: classify the discontinuities and clusters of discontinuities that typically accompany DS breaks in narratives; identify the linguistic and extralinguistic knowledge involved; develop heuristics for using this knowledge; and test the heuristics in a computer program. 6 This paper presents a classification that arises from an analysis of what constitutes a narrative.</Paragraph> <Paragraph position="10"> The first observation to make is that a narrative must have a \]plot, i.e., present a sequence of events that forms an instance of a recognizable pattern. (The patterns are part universal, part culture specific; the work in Lehnert 1982 can be seen as a search for the principles on which such patterns are built.) Using the Gestalt terminology (brought into linguistics by Talmy 1983) I can say that a narrative's plot must present a recognizable temporal/ causal figure shown against some ground that minimally consists of spatial/visual settings (descriptions of characters are also frequent). The distinction is not always clear-cut because elements of the figure can be hidden among the details of the ground, but the temporal nature of the plot does stand in clear contrast to the spatial nature of the ground.</Paragraph> <Paragraph position="11"> Secondly, a narrative must have characters with which we empathize. These characters don't have to be human: one can easily imagine a story about an adventuresome plant seed that falls off its parent, gets swallowed and excreted by a horse, and nearly drowns in a tropical rain before being miraculously saved by the sun and producing a flower. Even so, the narrative is likely to alternate between the objective narrator's point of 38 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative view and that of one of the characters. (&quot;The belly of the horse was dark and noisy inside.&quot;) Some discontinuities of narratives very simply reflect the intrinsic, spatial, or temporal discontinuities of the plot, when, e.g., the story is composed of a sequence of events taking place during an afternoon, followed by two years' hiatus, followed by another action-packed afternoon. (A similar example of a spatial discontinuity can be easily imagined.) One measure of the &quot;simplicity&quot; of a narrative is how faithfully the order and structuring of its text reflects the order and structuring of its component events. Even in the simplest narrative, however, there are bound to be discontinuities resulting from the tension between the linear nature of the text and the multidimensional structure that it is meant to evoke.</Paragraph> <Paragraph position="12"> These are discontinuities of figure and ground, when the narrative shifts between the main story line and the surrounding circumstance, and discontinuities of perspective, when the narrative crosses into a different &quot;empathy space&quot; or creates a new one.</Paragraph> <Paragraph position="13"> Given this classification of discontinuities one can proceed to catalogue the clues that signal them. This is a subject for a large empirical study, of which the next section is but a preliminary sketch. It is important to keep in mind that it is clusters of discontinuities that signal the beginning of a new DS.</Paragraph> </Section> <Section position="9" start_page="0" end_page="0" type="metho"> <SectionTitle> 6 DISCONTINUITIES IN NARRATIVES 6.1 DISCONTINUITIES OF TOPIC </SectionTitle> <Paragraph position="0"> Discontinuities of topic fall in two groups. In the first, there is no anaphoric relation or immediate inference path from the new topic to a node in the CFS. What &quot;an immediate inference path&quot; is depends, of course, on the system's knowledge base and inferential capabilities, but this is a separate big issue that is not dealt with in this paper. The second kind of discontinuity, noted in Reichman 1985, arises when an anaphoric relation exists and allows a pronominal anaphor, but the topic is instead reintroduced by a full noun phrase. I call this phenomenon topic reintroduction and illustrate it in 6.5.</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 6.2 TEMPORAL DISCONTINUITIES </SectionTitle> <Paragraph position="0"> The most important temporal discontinuities are: (a) A shift from perfective to imperfective sentence perspective accompanied by a shift to a much greater time scale. The corresponding rhetorical move is frequently characterized as &quot;introducing background or descriptive material&quot;. The move is frequently accompanied by a topic reintroduction. null (b) The reverse shift from descriptive material to the main line of the narrative. This move is signaled by the temporal focus (TF), and the entire deictic center, returning to an established node in the ESS, with an appropriate contraction of the time scale.</Paragraph> <Paragraph position="1"> (c) A backwards move of the TF to an earlier point in time, with or without a change in time scale. Rhetorically, this is known as flashback. This move is frequently signaled by a verb in past perfect or by the used to + infinitive construction, although a shift may occur without such a verb form, and the presence of such a verb form does not necessarily signal a shift: the reference time of the sentence may remain the same as, rather than precede, the current TF. (Section 7 illustrates; see also Almeida 1987 and Webber 1987b, this volume.)</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 6.3 SPATIAL DISCONTINUITIES </SectionTitle> <Paragraph position="0"> The only purely spatial discontinuities I have looked at are discontinuities of scale. Just as h-types have time scales associated with them, characterized in terms of &quot;received&quot; cyclical events such as day or year, it seems equally necessary to establish a gradation of spatial scales, based on similar considerations from human biology and habitat. (A hint at the same idea can be found in Nirenburg et al., 1985:230.) Some important spatial scales are: tiny, hand, one's body (may need to be further specialized); within arm's reach; room area, such as desk or bed; room; floor; house; household; village or neighborhood; larger area within a betweenmeals round trip; within a day's round trip; staying overnight. (The larger the scale, the more domain- and culture-specific variation there is.)</Paragraph> </Section> <Section position="3" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 6.4 DISCONTINUITIES OF PERSPECTIVE </SectionTitle> <Paragraph position="0"> Two kinds of phenomena are clearly relevant: attitude reports, and indications of the spatial position from which things are perceived (the WHO-point in Bruder et al., 1986). The subject needs further study.</Paragraph> </Section> <Section position="4" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 6.5 AN EXAMPLE </SectionTitle> <Paragraph position="0"> The remainder of this section shows how the above heuristics work themselves out on the initial fragment of Joyce's &quot;Clay&quot; (Joyce 1969:99-106). The fragment falls into three DSs: Sentence 1, sentences 2-6, and sentences 6-12; Sentence 13 starts a fourth DS by returning to the first one. Embedded in DS3 is DS3.1 consisting of sentences 9-10.</Paragraph> <Paragraph position="1"> 1. The matron had given her leave to go out as soon as the women's tea was over and Maria looked forward to her evening out. 2. The kitchen was spick and span: the cook said you could see yourself in the big copper boilers. 3. The fire was nice and bright and on one of the side-tables were four very big barmbracks. 4. These barmbracks seemed uncut; but if you went closer you would see that they had been cut into long thick even slices and were ready to be handed round at tea. 5. Maria had cut them herself.</Paragraph> <Paragraph position="2"> 6. Maria was a very, very small person indeed but she had a very long nose and a very long chin. 7. She talked a little through her nose, always soothingly: Yes, my dear, and No, my dear. 8. She was always sent for when the women quarrelled over their tubs Computational Linguistics, Volume 14, Number 2, June 1988 39 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative and always succeeded in making peace. 9. One day the matron had said to her:--Maria, you are a veritable peace-maker! 10. And the sub-matron and two of the Board ladies had heard the compliment. 1 I.</Paragraph> <Paragraph position="3"> And Ginger Mooney was always saying what she wouldn't do to the dummy who had charge of the irons if it wasn't for Maria. 12. Everyone was so fond of Maria.</Paragraph> <Paragraph position="4"> 13. The women would have their tea at six o'clock and she would be able to get away before seven.</Paragraph> <Paragraph position="5"> The transition from Sentence 1 to 2 is very similar to the transition from (a) to (b) in Example 6: it is a shift from figure to ground marked primarily by aspectual changes and a shift from temporal to spatial/visual material. The change in time and space scales is not as dramatic here, but, on the other hand, there is a discontinuity of perspective as the story shifts from Maria's attitude report (&quot;Maria looked forward&quot;) to the objective perspective of the implied narrator (&quot;The kitchen was spick and span... &quot;).</Paragraph> <Paragraph position="6"> The transition from 5 to 6 (paragraph break) is characterized by a shift in time scale and a topic reintroduction. The material is a (back)ground character description: &quot;Maria is such that .... &quot; The beginning of DS3.1 is signaled by a well-known &quot;clue phrase&quot; One day and by the past perfect tense. Note that the material is still ground (&quot;Maria is such that .... &quot;) but the precise relationship between DS3 and DS3.1 is ambiguous: they can be siblings. Only when DS3 returns in Sentence 11 (the signals are the return to the simple past and the sentence-initial and) is it established that DS3.1 is, indeed, embedded in DS3.</Paragraph> <Paragraph position="7"> The transition from 12 to 13 (the next paragraph break) is a return to the event sequence of Sentence 1.</Paragraph> <Paragraph position="8"> The position of the TF, and the entire deictic center is indicated by the future-in-the-past tense. Note that it is essential to retrieve the entire deictic center, and not just the TF, because the WHO-point is also restored: the she in 13 does not evoke Maria of Sentence 12 or any other sentence in DS2 and 3 presented from the perspective of the implied narrator. Rather, this pronoun is a quasi-indexical (Rapaport 1986) that replaces the first-person singular I of a portion of the narrative from Maria's perspective, signaled by Maria looked forward in Sentence 1.</Paragraph> </Section> </Section> <Section position="10" start_page="0" end_page="0" type="metho"> <SectionTitle> 7 NARRATIVE MOVES </SectionTitle> <Paragraph position="0"> It is a common intuition that, as a narrative progresses, TF changes its position in time. I will refer to the movement of TF from one sentence of the narrative to the next as the narrative move. Narrative moves can be classified along several dimensions. First off, the TF can remain in the same timeline or branch off to an alternative one. (Such branchings-off are usually signaled by a nonindicative mood of the verb; this paper has nothing more to say about them.) Within the same timeline, the TF can move forward, sideways (into an overlapping h-token), or backwards, to an earlier htoken. In each of these three cases, the move may or may not be accompanied by a change in time scale.</Paragraph> <Paragraph position="1"> Finally, the TF can remain within the current focus space or shift to a different one. (In terms of linguistic structure, the current sentence may continue the same, or start a new, DS.) I will coin the terms mieromove and macromove for the last distinction.</Paragraph> <Paragraph position="2"> The preceding section argued that, if considered in the context of the entire deictic center movements, macromoves can be fully catalogued and associated with a limited set of rhetorical labels. It appears even more likely that micromoves also form a small set of recurrent patterns, because they are additionally constrained by the requirement that they cannot introduce a drastic discontinuity. I present a sample of micromoves below, contrasting them with macromoves; the exposition is organized by the temporal classification of narrative moves (backward, sideways, or forward).</Paragraph> <Paragraph position="3"> Much of the material of this section overlaps with Webber (this volume); some of the examples are from Dowty 1986.</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 7.1 BACKWARD MOVES </SectionTitle> <Paragraph position="0"> Backward moves are always macromoves, i.e., they always introduce a DS break. This is not as drastic a claim as it might seem, because simply narrating an h-token that precedes the current TF does not necessarily introduce a backward move. In particular, one sentence in the past perfect does not produce a backward move. (This point is made, with respect to the French plus-que-parfait, in Kamp and Rohrer 1983:256 fn.) If the out-of-sequence h-token is presented perfectively, i.e., the RT is after it, the TF does not move backward. Compare the following two examples:</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="sub_section"> <SectionTitle> Example 7 </SectionTitle> <Paragraph position="0"> The telephone rang: it Was Mme Dupont; her husband had eaten too many oysters for lunch.</Paragraph> <Paragraph position="1"> The doctor recommended a change in lifestyle.</Paragraph> <Paragraph position="2"> Example 8 . . . Tessie sat beside \[Phoebe\] with Araminta. Araminta had begun as an ear of corn. On a piece of cloth Mother had painted a face, using bluing for the eyes... She had stitched on smaller pieces of cloth, black for the nose, pink for the checks and red for the mouth. She had tied the cloth over one end of the ear of corn, then made a - bonnet to go over that... All in all Araminta was one of the most elegant members of the Dawson family. (Worline 1956:28).</Paragraph> <Paragraph position="3"> In (7) the past perfect is like the present perfect in a dialogue: it indicates precedence and relevance but does not shift the TF. In (8) the past perfect is like a past tense in a dialogue: it creates a new TF position to which subsequent sentences refer, until simple past 40 Computational Linguistics, Volume 14, Number 2, June 1988 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative terminates the embedded DS and returns to the one interrupted by the backwards move.</Paragraph> <Paragraph position="4"> A sentence in the simple past following a sentence in the past perfect does not always terminate an embedded DS: nontemporal anaphora may integrate the two sentences (cf. Webber, Example 20 in this issue, and discussion). Such a past perfect/simple past configuration is a must if one wants to go into a long, multi-FS digression, because there seems to be a strong limit on how long an embedded DS can continue in the past perfect, as in (8). Consider: Example 9 John was the captain of Penn's squash team. He had previously been captain of the Haverford team, but the coach there started chasing after John's girlfriend and John got very upset. One day he went to talk to the coach, but...</Paragraph> <Paragraph position="5"> Example 9 can now continue with an embedded narrative of arbitrary length. However, the interrupted DS has to be eventually resumed, otherwise the narrative is perceived as sloppy (or, if Proust does it, self-conscious, cf. Genette 1980:66-67).</Paragraph> </Section> <Section position="3" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 7.2 SIDEWAYS AND FORWARD MOVES </SectionTitle> <Paragraph position="0"> The majority of micromoves belong in these two categories, and in this brief subsection I cannot possibly do justice to them all. What I try to do is establish a framework, based on the conceptual apparatus of this paper, for a more detailed investigation.</Paragraph> <Paragraph position="1"> I assume, for simplicity, that both the most recently processed sentence (SO) and the current sentence (S1) consist of just one clause each, narrating h-tokens H0 and HI, respectively; the verbs in the sentences, V0 and V1, can be either past or progressive, and their aspectual classes can be any of those introduced in Section 3. Consider a few combinations.</Paragraph> <Paragraph position="2"> H0: telic, simple past-Hl: telic, simple past Example 10 a. John entered the president's office. The president got up.</Paragraph> <Paragraph position="3"> b. John blew his nose. The president mopped his brow.</Paragraph> <Paragraph position="4"> c. John hid in the president's office. Bob locked himself up in the closet.</Paragraph> <Paragraph position="5"> This combination is most frequently a sequence, as in (a), but not necessarily so. If H0 and H1 are not related by an obvious chain of causality the sequencing is not obvious, and if they both can be construed as responses to the same event, they are interpreted as overlapping, cf. (b) and (c).</Paragraph> <Paragraph position="6"> H0: telic, simple past-Hl: atelic or state, simple past H0: telic, simple past-Hl: non-instantaneous, progressive Example 11 a. John entered the president's office. The president was asleep.</Paragraph> <Paragraph position="7"> b. John entered the president's office. The president was writing a letter.</Paragraph> <Paragraph position="8"> In these examples, there is no compositional relation between H0 and HI, and H1 is interpreted as beginning before H1 and continuing after it. However, consider the following texts, in which H1 is part of the body, or the resulting stage, of H0: Example 12 John knelt at the edge of the stream and washed his hands and face. He washed slowly, feeling the welcome sensation of the icy water on his parched skin.</Paragraph> <Paragraph position="9"> Example 13 a. Bob turned off the light. It was dark around him. b. Bob heard a loud click. The door behind him was locked.</Paragraph> <Paragraph position="10"> c. Bob closed his eyes. He was asleep.</Paragraph> <Paragraph position="11"> In (13), the narrative moves forward, into a narrated h-token. The conditions for the move are that there is a direct inferential link from H0 to HI, and/or some sort of incompatibility between them. So, before Bob's eyes were closed they had to be open, which, for most of us, is incompatible with being asleep; on the other hand, one frequently closes one's eyes with the intention of going to sleep. 7 H0: state, atelic or progressive-HI: state, atelic or progressive Example 14 a. John was fixing an electrical appliance downstairs. Suddenly, the whole house was dark. b. The president was conducting the interview with Mary in a cold, official manner. Suddenly, to her utter amazement, he was offering her an important job.</Paragraph> <Paragraph position="12"> The contrast between (11) and (13) illustrates the following narrative principle: it is easy to move sideways into a narrated h-token, but it is difficult to move forward into one. (In other words, an imperfective sentence will not be interpreted as advancing the TF unless such an interpretation is forced by some form of incompatibility and causality.) It is especially difficult to move forward from inside one interval into another one, as in Example 14. In fact, such a move cannot be effected without the help of an adverbial that acknowledges the gap (try removing suddenly from the examples). null This concludes my discussion of narrative moves. The sampling given here is certainly not complete, but an extensive empirical investigation can, I believe, produce a full catalogue, organized by the configuration of tenses, aspectual classes, and time scales of the Computational Linguistics, Volume 14, Number 2, June 1988 41 Alexander Nakhimovsky Aspect, Aspectual Class, and the Temporal Structure of Narrative h-tokens involved. As the examples of this section show, no such configuration determines the narrative move uniquely, but they strongly constrain the choices.</Paragraph> </Section> </Section> class="xml-element"></Paper>