File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/93/w93-0211_metho.xml
Size: 15,015 bytes
Last Modified: 2025-10-06 14:13:31
<?xml version="1.0" standalone="yes"?> <Paper uid="W93-0211"> <Title>Observations and Directions in Text Structure</Title> <Section position="2" start_page="0" end_page="40" type="metho"> <SectionTitle> 2 Text Structure Primitives: A Tool or Side-Effect? </SectionTitle> <Paragraph position="0"> However, there is even more to the information-hiding nature of McKeown's schemas than just intentional stl'll(:tHl'e, which primitive-lmsod text StL'uCttlL'illg a.l~l~roat:lms se.enl to suflk~r front in general. They all seem to hide certain &quot;l~sychologicld&quot; dyuaniics which ull(hJubtedly underlie ahnost all human linguistic interaction. For example, it is l)ossible that McKeown's schema for defining an object might be derivahle from more general principles of intbl'mation processing: Definitions tend to hegin with an identification to give tile hearer a default set: of attributes in one fell swoop, then follow that with a shorter list of specific attributes to override those inherited. But even this al)proach might not be best schematized, as the following text seems to serve quite well as a definition: Q: &quot;What's a kumquat?&quot; A: &quot;It's like a teeny little orange, but really sour.&quot; This represents yet another technique, not relying upon inherita.nce but on the hearer's worhl knowledge, to convey a great amount of attributive inibrmation with very few resources. As McKeown's schelnas were only arrived a,t through a. process of text. a.lialysis, we know nothing about exactly what lnight make a 1)articular schema.l.ize(I l~resent~l.t.iou of ill\['ortHation effective at any given tilne, or how to choose between schema.tic texts if more t.ha.n oite is possil)le. No doubt certain hearers might prefer many examples, or perhaps a pa.rticula.r concept lends itself&quot; very well to analogy, but at present we have no (leep understa.llding a.s to wha.t would iadicate that examples or analogies are usefid.</Paragraph> <Paragraph position="1"> Further, a reasonable analysis of an existing text can confiate, confuse or obscure the actual processes that went: into composing it with other f;tctors only secondary to the cohesiveness of the text. As Moore & Pollack point out in \[MP92\], there can be non-isomorphic relations between text spans on different dimensions; we add that it is therefore unclear which of these relations contrihute most to the coherence of tile text. It is possible that some relations are really si(le-effects of others. In Moore & Pollack's example, &quot;Come home by 5:00. Then we can go to the hardware store bel'~)re i1 close~. '|'hal w:~. we can finish the bookshelves l.cnfight.'&quot; there is a right-branching tree of Motivation relations, as well as a left- bra.uchiHg tree of ComlitiolL relations both structuring this text. However, the Motivation relations seem to be suflici(~nt to structure this text, for probably the most likely way to motivate someone to do an act is to show how some desirable state of affairs is conditional on that act (and in fa.ct, this is exactly one type of plan operator Moore & Swartout's genera,tion system would use). In this ca.se, the &quot;informa.tional&quot; relations seem directly del)endent on the intuitively more ceJltra.1 &quot;ln'(~seld.a, tional&quot; ones. This seems to suggest that the mere th.ct that relations al)l)ea.r in text after text in the ;~ua.lysis stage does not imply that those relations are the most useful tools to plan with.</Paragraph> <Paragraph position="2"> In fact, this last point seems to I)e an essential one to explore. Both McKeown and Ma,nn & Thompson found regularities in the texts they analyzed, and used these regularities to characterize coherent texts. But his characterization can only serve as a first a.1)l)roxinta.tion. Perhal)s they've shown that coherent texts have RST trees or fit certain schemas, but it's clearly not the case that schematized texts or ones with R,ST trees are necessarily well-tormed, even if they obey some additional constraints (e.g., focusing).</Paragraph> </Section> <Section position="3" start_page="40" end_page="41" type="metho"> <SectionTitle> 8 Choosing Appropriate Instantiations </SectionTitle> <Paragraph position="0"> In tim same way that no theory we know of can guide a search among possil)le rhetorical strategies (Analogy, Identification, etc) as to their effects on the hearer and apl)ropriateness, the content of individual relations and 1)redicates are underspecified to such a.u extent that given a rich enough knowledge base, a single well-formed rhetorical skeletoll could be incarnated with any amount of suboptimal or inapl)ropriate intbrmation making for an ineffective or incoherent text.</Paragraph> <Paragraph position="1"> To illustrate, imagine that one's task is to generate an explanation of the important social factors leading up to and resulting from the American C, ivil Wa.r. \]n describing the circumstances of the start of the war, there are many attitudes, events, a.nd characteristics of the mid-1800's we could choose from to achieve setting up the circumsta.nces. Among thelll, we know about the tensions between the industrialized North and the agricultural South, institutionalized racism, perhaps something M)out the attitudes of the people at the time towa.rds their country and their fifestyles, and many others. But a. rich knowledge base also contains a lot of il~formation less useful given our task; the opening up of the Orient to trade with the West, in stitu t ionalized (liscrinfination against women, a.nd perhal)s something al)out style of (h'ess or what a typical day was like on ;i.</Paragraph> <Paragraph position="2"> South Carolina plantation. All of these serve as providing circumstalltia,l information, and given a perfect text fi'om a history book which makes use of some circumstantial intbrmation along the lines of the first set above, it's quite likely one could find information equally circumstantial along the lines of the second set, focussing on the very sa.me notions (econonfics, prejudicial attitudes, or some aspect of human interest), 1)ut whose use would have iml)aired the logical flow of the text.</Paragraph> <Paragraph position="3"> Similarly, in giving Background information about Abrahant Lincoht, s(mm information such a,s his attitudes towards state's rights an(l secession, and perhal)s his legendary morality and honesty, nmst be sifted out of a rich knowletlg(~ base at the expense of other ill\[(1)l'lll;i.1;ioll a.I)out his mol(~, his beard, his childhood, or his clinical del)ressions, though a.ll are construed as Attributive relations which focus on Lincoln and in some sense give the rea.der Backgrouu(I iulbrma.tion about Lincoln.</Paragraph> <Paragraph position="4"> Such a problem seems to involve ca.lcula.ting relewl.nce, which h~.s not r(~ceived much attention as it relates to formal text structure d(wices like schemas or RST tree co\]istru(:tion. An interesting question, however, is how clearly releva.nce selection can be divorced fl'om formal structuring. There is no inherent irrelevance in any information contained in a knowledge I)ase. (riven dif\[k~rent gross organizations and aims of the text, like a tale of the (',ivil War as gothic rom;i.zlce, boo l) skirts and oppression of women might be mentioua.ble within the sa,me text as desc.ril)t,i(nis of Southern political rage, and in writing a humanistic 1)sychological treatise one caai expound at length on Lincoln's melancholy and similar attributes where very (litl~rent attributes were menti(med in the sociological text. It seems that all these difl'erent types of texts share the property that the skeleton of rhetorical relations are fleshed out with information which is in some way linked to a. I)a.sic I)a,ckl)one or kernel of iuformation which organizes the text - the &quot;gist&quot; of the text. If the gist of the text is to give a sequential historica.l account of a War, tiffs backl)one might consist of sequential Causes and Eil~cts, with events and l)layers introduced as they figure in events mentioned, and elaborated with information which is related not simply in one way to tile inmledia.te context, forming a tree, but l)erhal)S to several t)arts of tile l~ackl)one, forming a gra.l~h. Thus ill a. text which aims at a specifically sociological account of the war, hoo l) skirts and Lincohl's mole, though associated in the knowledge base to aspects of the Civil War, (Io not relate i~k any iHterestilhg way to other pieces of information in the gist of the tale and doil't warra.llt melLtioll, while his ~l.ttitudes as a, l~olitician not only characterize him as personal a.ttril)utes but also might serve ;is I~oinll,('rs to why he behaw~d as he did or wily events untblded as they did.</Paragraph> </Section> <Section position="4" start_page="41" end_page="42" type="metho"> <SectionTitle> 4 Where To Go From Here </SectionTitle> <Paragraph position="0"> The rhetorical relations and predicates l)roposed in the literature, then, are l)robal)ly best viewed as approximations to i)e analyzed at a (leeper level. It seems that the recurring patterns of relations and l)redicates that text ana.lysis revea.ls are best viewed a.s common linguistic, goa.1satisfaction techniques -. techniques for enha.ncing understanding, l)romisillg directions for fllture research in that field would then be a.long several lines: Discerning the uHderlyillg principles that make rhetoric~d techniques no usefifl, (liscerning the dynamic, situa.tiollal phenomena which license the use of these techniques, and discerning what sorts of iJlti)rmatioJ~ best ca.rries out the work of the techniques in any given text.</Paragraph> <Paragraph position="1"> For the first tack, we would need to consi(ler the effectiveness and efficiency of the various techniques. While the systems of Moore & Swartout \[Moo89, Msgl\] aa~d Maybury \[May92\] try to incorporate rhetorical techniques into an intentional, goal-orieHted framework, they \]lave not shown how one technique should be chosen above the other when 1)(~th axe l~ossil)le. Maybury does not address tlle issue at all. Moore ~% Swa,rtout list: selection;i,l heuristi('s which preti:r operators that lnake the least demands on the hea,rer, but this very tr;i,IlSi);i,l'(,llt ilSe O\[&quot; the user ll|odel ix not sufficient to make tile most effective ('hoices. For exa.ml)\]e, in (h,scrihinlg a. missing person 1,o a police officer, the literature suggests such varied options a.s ('(}liq}are/conti'ast, a.ttributive descriptions, analogic~d descriptions, description 1)y 1)arts and SUbl}arts, and so on. The most effective options in that situation are probal)ly the attril)utive (&quot;lie's 5'1 l&quot;, 1901bs, brown hair, gray eyes, pale&quot;) and compare/contrast (lescriptions (&quot;He's abotlt your build but a. little shorter&quot;), but to decide this requires some understanding of the priucil)les (l~erha.I)S l)sychological or cognitive) which underlie the identity of each of the techniques. Efficiez~cy coltsitlera.tions a.lso come into play.</Paragraph> <Paragraph position="2"> In the current scenario, given a choice between the two most efl'ective techniques, we might prefer a compare/contrast description technique if a good candida.te for COml}arison is available. The rationale might be that comparison to an entity tra.nsmits ahnost all of that entity's attributes with a single invocation, and the contrast allows quick overriding of a few salient attril)utes.</Paragraph> <Paragraph position="3"> The second tack, discerning the dynamics of communication which license the use of these predicates and relations, is intimately related to the first, for the communic~l,tiorl situation provides the impetus for sea,rching for a technique in the first place. The &quot;missing person&quot; scenario above creates several interacting needs and desires on the parts of the l)a.rticil)altts. The relation of rhetorical techniques to these goals needs to I)e more ffillly ibcusse(l on. Similarly, the situation estM)lishes a perspective on what is to 1)e sa.id; since the police will wa.nt to look for the missing person, his physical attributes are highlightetl by the situa.tiolL, givi~Jg the impulse and facility for creating an attributive description or a comparative descriptioa highlighti.g physical lba.tures. In an another example, fl'om Ma.nn & Tlmmpsoll, &quot;What if you're having to clean flOl)l)y drive heads too often? Ask lbr Sylkcom diskettes, with burnished Ectype coating a.nd dust-absorbing jacket liners...&quot; the overall relation seems to be one of Solutionhood. However, this relation in and of itself does not reflect the persuasive nature of the text. It does however reflect more general princil)les of what a reader desires fl'om a product (a solution), how to attract the interest of a. rea.tler (pose a question s/he will want an answer to), and so Oil. A cursory glance through magazine :l.dvertisements shows an unusually large numher of ads structured this wa.y, and with very good rea.son.</Paragraph> <Paragraph position="4"> And in following the third tack, determining appropriate context, it's useful to consider a, wider range of texts than seems to have been considered beibre. We chose here to think about historicM narratives to illustrate how a very rich knowledge base might conibu nd selection of relevant information, but at the same time suggest a richer model of cohereHce I)a.sed on more than just recursive, nested constituents. Typical domains in which text structure is studied axe task-oriented in nature or are based on knowledge hases rich in isa-links but litth, else. To get at a more fundamental understantling of what fa.ctors influence text structure, olw shonl(l look a.cmss m;uly different text types and seek points of similarity. For exa.mple, is there a conJwction between the way a well-drawn comparison 1)rovides an etficient classiiica.tion of a.n item in the hearer's COml~lex knowledge base anti the way a well-chosen sel, of ha.ckground inibrlua.tion lm)vi(les support and explanation for the complex information to come? .</Paragraph> <Paragraph position="5"> In the end, the inventory of techniques a genera, tion systenl dra.ws Up(:)ll will need to he indexal)le by these interpersonal dynamics. They will also need to I)e chara.cterizalfle in ternls which facilitate constructing effective, efficient text. These COl~sidera.titms seem to (h,m;i.nd a,n examina.tion of rhetorical structures in terms of the mental states of the conversa.tion pa.rtners, their perspectives and wants, and also perhal~s in terms of the mental structures of the conversa, tion pa.rtners, exploiting how they process analogies, sinfiles, and classilications. In sum, we need to disassemble the information compiled into rhetorical relations a.nd predicates.</Paragraph> </Section> class="xml-element"></Paper>