File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/75/t75-2020_abstr.xml
Size: 9,868 bytes
Last Modified: 2025-10-06 13:45:45
<?xml version="1.0" standalone="yes"?> <Paper uid="T75-2020"> <Title>BAD-MOUTHING FRAMES</Title> <Section position="1" start_page="0" end_page="93" type="abstr"> <SectionTitle> BAD-MOUTHING FRAMES </SectionTitle> <Paragraph position="0"> It appears that many people in the AI/psycholinguistics community are like my old friend (in California) who said: &quot;How can I understand something unless I believe it for a while.&quot; This seems to me to indicate the role of &quot;paradigms&quot; such as &quot;frames&quot; in the study of thought. Since I do not, myself, work that way and also do not (despite years of the New York Review) function well as a critic of scientific developments, I will limit myself to three rather concrete sets of remarks. These concern vision, interactions with the world and net models in the context of &quot;frames&quot;. Much of the discussion of Minsky's frames paper is concerned with visual perception. There are a number of conclusions about visual perception and visual memory that result from the frames paradigm which appear to me to be wrong and even wrongheaded. The notion that we store a large number of separate views in purely symbolic form is one such. There is a good deal of evidence that people do use three-dimensional models (e.g. Shepard) and that they regularly integrate several views into a single visual model which seems to be the predominate one (e.g. stereopsis, motion parallax). There are other examples of this sort, some of which are clearly the scars of old battles against perceptions, analog mysticism and the like. None of these are beyond incorporation into a frames paradigm.</Paragraph> <Paragraph position="1"> That this is true results from and illuminates the main difficulty with the frames paradigm as a theory -- it seems to be extensible to include anything at all.</Paragraph> <Paragraph position="2"> Without going as far as Popper ( ); we can ask that a theory be at least conceptually refutable. We propose an anti-frames hypothesis below.</Paragraph> <Paragraph position="3"> Another serious problem with a frames approach to vision lies in the strong assumption of default values for slots. The assumption is that, when viewing a new scene, we are basically just verifying our frame picture of it. Here a little intellectual history is called for. Early efforts in machine perception (and much perceptual psychology) were concerned with visual processes which operated independent of context. We studied edge detectors, pattern classifiers and algorithms for partitioning general straight-line drawings.</Paragraph> <Paragraph position="4"> This not only proved difficult but offered no promise of extension to typical real-world scenes. There then came a concerted effort to overcome (or circumvent) perception problems by giving programs lots of domain knowledge. This has been carried to the extreme of visual perception without vision, viz. anything black and on a desk is a telephone. The frames paper seems to approach vision in this way. Once again, there is substantial evidence (and overwhelming intuition) that this is not what occurs. For example, people can</Paragraph> <Paragraph position="6"> understand totally unexpected images presented for quite short periods. A recent mm study by Potter indicates that verbal | descriptions (which could give rise to very many images) are almost as efficient as a preview for choosing a target in such a set. mm To return to the questions posed at R the beginning, one does not need to know exactly what a thing will look like to detect it in a I/3-second i glimpse. In fact, knowing the | exact appearance of a target was little better than knowing only its general meaning, which suggests tht a scene is processed rapidly to an abstract level of meaning before m intentional selection occurs (Potter 1975).</Paragraph> <Paragraph position="7"> i Well; even Minsky doesn't claim the g frames paper is right -- only that it should help us ask the right questions. In machine perception, it seems to be having a * significant impact which I consider | negative. Frames and the whole idea of vision as mostly problem solving seem to have subverted work on the understanding of a images. It has always been tempting to | avoid the hard detailed problems of real images for the higher realms of abstract problem solving. The frames paper has had the effect (it really has) of sanctioning this retreat. i My second comment is an extension of this last point -- the actual effects of the mm world on a &quot;frames&quot; process is much too | weak. This has been a major concern of mine lately. Quoting from a recent paper: | &quot;The wheelless student problem S models the real-life problem of buying a used automobile (in the United States). A typical In procedure is first to read g newspaper advertisements and bulletin boards to assess the situation generally. Then, at m relatively low cost, one can * telephone various purveyors of cars i and inquire about them. At some point, one must actually go to the effort of seeing and driving * certain of these. There are g professional diagnostic services which can be employed (at considerable cost) to further test the car. In each of these steps, * one must decide when to stop that m stage and go on to the next one.</Paragraph> <Paragraph position="8"> One does not, of course, proceed in strict order; there will normally * be alternatives at several | different levels of investigation.</Paragraph> <Paragraph position="9"> Notice that the &quot;plan&quot; itself is trivial: read, telephone, look, B drive, professionally test and buy. i It is the application of this plan to the world situation which is difficult. We believe that much intelligent activity is | characterized by complex</Paragraph> <Paragraph position="11"> applications of simple plans and this belief has led to concentrate on the closely related questions of plan elaboration and execution.&quot; The point to be made here is that the world does not hold still for us. Theories which propose a narrow channel between the mind and the world have considerable value, but we will have to move beyond them to achieve real explanatory or robotic power.</Paragraph> <Paragraph position="12"> It is the relatively static nature of the frames paradigm which makes me most uncomfortable. I consider the main substantive claim of the frames paradigm to be that knowledge is stored in relatively coherent chunks which change only slowly over time. The opposing hypothesis is the older &quot;net&quot; model which says that the collection of knowledge being brought to bear at a given instant is a rapidly changing function of the situation. Once again, we can blur the distinction, but then I don't know what the frames paradigm is.</Paragraph> <Paragraph position="13"> Let us consider this question of static versus dynamic clumping of knowledge. We can assume, in both cases, that knowledge can include procedures, contexts and any other clever things that we dream up (Bobrow and Winograd have dreamed up dozens). The question is whether the high bandwidth connections among knowledge primitives can be fairly static. There are important reasons for hoping the frames hypothesis is true for animals or could be employed by machines.</Paragraph> <Paragraph position="14"> The general advantages of partitioning complexity are well known (Simon, Alexander). For any current or projected computers there is a memory hierarchy which strongly favors coherent chunks of knowledge relatively loosely coupled with other chunks.</Paragraph> <Paragraph position="15"> Unfortunately, I see great difficulty with static frames of anywhere near the scale discussed by Minsky.</Paragraph> <Paragraph position="16"> Try to introspect as you slowly read the following sentence: &quot;Imagine yourself walking into a room; it is the master bedroom of a quiet Victorian house, in a slum of Bombay, which has just had a fire and been rebuilt in modern style, except for the master bedroom which is only half remodeled having its decorative panelling intact but badly visible because of the thick smoke.&quot; The sentence above causes several shifts and refinements of the image. The question is, of course, where are the frames. It is possible that there are a very large number of room frames embodying all the combinatorial possibilities hinted at above. Alternatively, there could be a single room frame that incorporated all these possibilities. Neither of these alternatives strikes me as plausible. What seems to happen is that we build our model dynamically as we process the sentence. The anti-frame hypothesis here is that the connections which are most important (heavily used, etc.) are not specifiable in advance.</Paragraph> <Paragraph position="17"> Let us return to the problem of buying a used car. I believe that what happens is roughly as follows. We construct a goal-oriented subsystem making use of our knowledge of cars, buying, our locale, friends who know about cars, etc. This subsystem seems to have great internal coherence and rely only slightly on our total world knowledge about e.g. buying.</Paragraph> <Paragraph position="18"> The used-car-buying subsystem probably gets drastically changed each time it is reinvoked with only a few general principles carried over from its last incarnation.</Paragraph> <Paragraph position="19"> While it is active, the subsystem effects much of our perception of the real world and of our thinking. We notice different things, seek different people, make different mental associations, etc. My mind boggles at trying to model a system of such flexibility, but the frames hypothesis (as I understand it) seems more likely to lead me astray than to help.</Paragraph> </Section> class="xml-element"></Paper>