XML Viewer - p86-1004

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/86/p86-1004_metho.xml
Size: 38,722 bytes
Last Modified: 2025-10-06 14:11:53
<?xml version="1.0" standalone="yes"?>
<Paper uid="P86-1004">
  <Title>RECOVERING IMPLICIT INFORMATION</Title>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
LIC RELEASE, DISTRIBUTION UNLIMITED.
2 Prolog UNDderstands Integrated Text
</SectionTitle>
    <Paragraph position="0"> work in natural language understanding for the Strategic Computing Battle Management Program.</Paragraph>
    <Paragraph position="1"> The PROTEUS/PUNDIT system will map Navy CASREP's (equipment casualty reports) into a database, which is accessed by an expert system to determine overall fleet readiness. PUNDIT has also been applied to the domain of computer maintenance reports, which is discussed here.</Paragraph>
    <Paragraph position="2"> The paper focuses on the interaction between the syntactic, semantic and pragmatic modules that is required for the task of making implicit information explicit. We have isolated two types of implicit entities: syntactic entities which are missing syntactic constituents, and semantic entities which are unfilled semantic roles. Some missing entities are optional, and can be ignored. Syntax and semantics have to recognize the OBLIGATORY missing entities and then mark them so that reference resolution knows to find specific referents for those entities, thus making the implicit information explicit. Reference resolution uses two different methods for filling the different types of entities which are also used for general noun phrase reference problems. Implicit syntactic entities, ELIDED CONSTITUENTS, are treated like pronouns, and implicit semantic entities, ESSENTIAL ROLES are treated like definite noun phrases.</Paragraph>
    <Paragraph position="3"> The pragmatic module as currently implemented consists mainly of a reference resolution component, which is sptflcient for the pragmatic issues described in this paper. We are in the process of adding a time module to handle time issues that have arisen during the analysis of the Navy CASREPS.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="10" type="metho">
    <SectionTitle>
2. The Syntactic Component
</SectionTitle>
    <Paragraph position="0"> The syntactic component has three parts: the grammar, a parsing mechanism to execute the grammar, and a lexicon. The grammar consists of context-free BNF definitions (currently nulnbering approximately 80) and associated restrictions (approximately 35). The restrictions enforce context-sensitive welt-formedness constraints and, in some cases, apply optimization strategies to prevent unnecessary structure-building. Each of these three parts is described further below.</Paragraph>
    <Section position="1" start_page="10" end_page="10" type="sub_section">
      <SectionTitle>
2.1. Grammar Coverage
</SectionTitle>
      <Paragraph position="0"> The grammar covers declarative sentences, questions, and sentence fragments. The rules for fragments enable the grammar to parse the 'telegraphic&amp;quot; style characteristic of message traffic, such as disk drive down, and has select lock. The present grammar parses sentence adjuncts, conjunction, relative clauses, complex complement structures, and a wide variety of nominal structures, including compound nouns, nominalized verbs and embedded clauses.</Paragraph>
      <Paragraph position="1"> The syntax produces a detailed surface structure parse of each sentence (where '~entence&amp;quot; is understood to mean the string of words occurring between two periods, whether a full sentence or a fragment).</Paragraph>
      <Paragraph position="2"> This surface structure is converted into an 'qntermediate representation&amp;quot; which regularizes the syntactic parse. That is, it eliminates surface structure detail not required for the semantic tasks of enforcing selectional restrictions and developing the final representation of the information content of the sentence. An important part of regularization involves mapping fragment structures onto canonical verbsubject-object patterns, with missing elements flagged. For example, the tvo fragment consists of a tensed verb + object as in Replaced spindle motor. Regularization of this fragment, for example, maps the tvo sYntactic structure into a verb+ subject+ object structure: verb(replace),subject(X),object(Y) As shown here, verb becomes instantiated with the surface verb, e.g., replace while the arguments of the subject and object terms are variables. The semantic information derived from the noun phrase object spindle motor becomes associated with Y.</Paragraph>
      <Paragraph position="3"> The absence of a surface subject constituent results in a lack of semantic information pertaining to X.</Paragraph>
      <Paragraph position="4"> This lack causes the semantic and pragmatic components to provide a semantic filler for the missing subject using general pragmatic principles and specific domain knowledge.</Paragraph>
    </Section>
    <Section position="2" start_page="10" end_page="10" type="sub_section">
      <SectionTitle>
2.2. Parsing
</SectionTitle>
      <Paragraph position="0"> The grammar uses the Restriction Grammar parsing framework \[Hirschman1982, Hirschman1985\], which is a logic grammar with facilities for writing and maintaining large grammars. Restrict:on Grammar is a descendent of Sager's string grammar \[Sager1981\]. It uses a top-down left-to-right parsing strategy, augmented by dynamic rule pru, ing for efficient parsing \[Dowding1986\]. In addition, it Llses a meta:grammatical approach to generate definitions for a full range of co-ordlnate conjunction structures \[Hirschman1986\].</Paragraph>
    </Section>
    <Section position="3" start_page="10" end_page="10" type="sub_section">
      <SectionTitle>
2.3. Lexical Processing
</SectionTitle>
      <Paragraph position="0"> The lexicon contains several ~housand entries related to the particular subdomain of equipment maintenance. It is a modified version of the LSP lexicon with words classified as to part of speech and subcategorized in limited ways (e.g., verbs are sub-categorized for their complement types). It also handles multi-word idioms, dates, times and part numbers. The lexicon can be expanded by means of an interactive lexical entry program.</Paragraph>
      <Paragraph position="1"> The lexical processor reduces morphological variants to a single root form which is stored with each entry. For example, the form has is transformed to the root form have in Has select lack. In addition, this facility is useful in handling abbreviations: the term awp is regularized to the multi-word expression waiting ~for ^part. This expression in turn is regularized to the root form wait'for'part which takes as a direct object a particular part or part number, as in is awp 2155-6147.</Paragraph>
      <Paragraph position="2"> Multi-word expressions, which are typical of jargon in specialized domains, are handled as single lexicol items. This includes expressions such as disk drive or select lock, whose meaning within a particular domain is often not readily computed from its component parts. Handling such frozen expressions as '~dioms&amp;quot; reduces parse times and number of ambiguities. null Another feature of the lexical processing is the ease with which special forms (such as part numbers or dates) can be handled. A special '$orms grammar&amp;quot;, written as a definite clause grammar\[Pereira1980\] can parse part numbers, as in awaiting part 21556147, or complex date and time expressions, as in disk drive up at 11/17-1286. During parsing, the forms grammar performs a well-formedness check on these expressions and assigns them their appropriate lexical category.</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="10" end_page="13" type="metho">
    <SectionTitle>
3. Semantics
</SectionTitle>
    <Paragraph position="0"> There are two separate components that perform semantic analysis, NOUN PHRASE SEMANTICS and CLAUSE SEMANTICS. They are each called after parsing the relevant syntactic structure to test semantic well-formedness while producing partial semantic representations. Clause semantics is based on Inference Driven Semantic Analysis \[P~tlmer1985\] which decomposes verbs into component meanings and fills their semantic roles with syntactic constituents. A KNOWLEDGE BASE, the formalization of each domain into logical terms, SEMANTIC PREDI-CATES, is essential for the effective application of Inference Driven Semantic Analysis, and for the final production of a text representation. The result of the semantic analysis is a set of PARTIALLY instantiated ll Semantic predicates which is similar to a frame representation. To produce this representation, the semantic components share access to a knowledge base, the DOMAIN MODEL, that contains generic descriptions of the domain elements corresponding to the \]exical entries. The model includes a detailed representation of the types of assemblies that these elements can occur in. The semantic components are designed to work independently of the particular model, and rely on an interface to ensure a well-defined interaction with the domain model. The domain model, noun phrase semantics and clause semantics are all explained in more detail in the following three subsections.</Paragraph>
    <Section position="1" start_page="10" end_page="12" type="sub_section">
      <SectionTitle>
3.1. Domain Model
</SectionTitle>
      <Paragraph position="0"> The domain currently being modelled by SDC is the Maintenance Report domain. The texts being analyzed are actual maintenance reports as they are called into the Burroughs Telephone Tracking System by the field engineers and typed in by the telephone operator. These reports give information about the customer who has the problem, specific symptoms of the problem, any actions take by the field engineer to try and correct the problem, and success or failure of such actions. The goal of the text analysis is to automatically generate a data base of maintenance information that can be used to correlate customers to problems, problem types to machines, and so on.</Paragraph>
      <Paragraph position="1"> The first step in building a domain model for maintenance reports is to build a semantic net-like representation of the type of machine involved. The machine in the example text given below is the B4700. The possible parts of a B4700 and the associated properties of these parts can be represented by an isa hierarchy and a haspart hierarchy. These hierarchies are built using four basic predicates: system,lsa,hasprop, haspart. For example the system itself is indicated by system(b4700). The isa predicate associates TYPES with components, such as isa(splndle^motor~motor). Properties are associated with components using the hasprop relationship, are are inherited by anything of the same type. The main components of the system: cpu, power_supply, disk, printer, peripherals, etc., are indicated by haspart relations, such as haspart(b4700,cpu), haspart(b4700,power_supply), haspart(b4700,dlsk),,etc. These parts are themselves divided into subparts which are also indicated by haspart relations, such as haspart(power_supply, converter).</Paragraph>
      <Paragraph position="2"> This method of representation results in a general description of a computer system. Specific machines represent INSTANCES of this general representation. When a particular report is being processed, id relations are created by noun phrase semantics to associate the specific computer parts being mentioned with the part descriptions from the general machine representation. So a particular B4700 would be indicated by predicates such as these: id(b4700,systeml), id(cpu,cpul), id(power_supply,power supply1), etc.</Paragraph>
      <Paragraph position="3"> 3.2. Noun phrase semantics Noun phrase semantics is called by the parser during the parse of a sentence, after each noun phrase has been parsed. It relies hea~iiy on thdomain model for both determining semantic well formedness and building partial semantic representations of the noun phrases. For example, in the ,~cntence, field engineer replaced disk drive at 11/2/0800, the phrase disk drive at 11/2/0800 is a syntactically acceptable noun phrase, (as in participants at the meeting). However, it is not semantically acceptable in that at 11/20/800 is intended to designate the time of the replacement, not a property of the disk drive. Noun phrase semantics will inform the parser that the noun phrase is not semantically acceptable, and the parser can then look for another parse, In order for this capability to be fully utilized, however, an extensive set of domaln-speclfic rules about semantic acceptability is required. At present we have only the minimal set used for the development: of the basic mechanism. For example, in the case described here, at 11/2/0800 is excluded as a modifier for disk drive by a rule that permits only the name of a loca-tion as the object of at in a prepositional phrase modifying a noun phrase.</Paragraph>
      <Paragraph position="4"> Tile second function of noun phrase semantics is to create a semantic representation of the noun phrase, which will later be operated on by reference resolution. For example, the semantics for lhe bad disk drive would be represented by the following Prolog clauses.</Paragraph>
      <Paragraph position="5"> lid(disk ^ drive,X), bad(X), del'(X), that is, X was referred to with a full, definite noun phrase, full_np (X)\] rather than a pronoun or indefinite noun phrase.</Paragraph>
      <Paragraph position="6">  8.3. Clause semantics In order to produce the correct predicates and the correct instantiations, the verb is first decomposed into a semantic predicate representation appropriate for the domain. The arguments to the predicates constitute the SEMANTIC ROLES of the verb, which are similar to cases. There are domain specific criteria for selecting a range of semantic roles. In this domain the semantic roles include: agent,lnstrument,theme, objectl,object2, symptom and mod. Semantic roles can be filled either by a syntactic constituent supplied by a mapping rule or by reference resolution, requiring close cooperation between semantics and reference resolution. Certain semantic roles are categorized as ESSENTIAL, so that pragmatics knows that they need to be filled if there is no syntactic constituent available. The default categorization is NON-ESSENTIAL, which does not require that the role be filled. Other semantic roles are categorized as NON-SPECIFIC or SPECIFIC depending on whether or not the verb requires a specific referent for that semantic role (see Section 4). The example given in Section 5 illustrates the use of both a non-specific semantic role and an essential semantic role. This section explains the decompositions of the verbs relevant to the example, and identifies the important semantic roles. The decomposition of have is very domain specific.</Paragraph>
      <Paragraph position="8"> It indicates that a particular symptom is associated with a particular object, as in 'the disk drive has select lock.&amp;quot; The object1 semantic role would be filled by the disk drive, the subject of the clause, and the symptom semantic role would be filled by select lock, the object of the clause. The tlme(Per) is always passed around, and is occasionally filled by a time adjunct, as in the disk drive had select lock at 0800.</Paragraph>
      <Paragraph position="9"> In addition to the mapping rules that are used to associate syntactic constituents with semantic roles, there are selection restrictions associated with each semantic role. The selection restrictions for have test whether or not the filler of the objectl role is allowed to have the type of symptom that fills the symptom role. For example, only disk drives have select locks.</Paragraph>
    </Section>
    <Section position="2" start_page="12" end_page="12" type="sub_section">
      <SectionTitle>
Mapping Rules
</SectionTitle>
      <Paragraph position="0"> The decomposition of replace, is also a very domain specific decomposition that indicates that an agent can use an instrument to exchange two objects.</Paragraph>
      <Paragraph position="2"> The following mapping rule specifies that the agent can be indicated by the subject of the clause.</Paragraph>
      <Paragraph position="4"> The mapping rules make use of intuitions about syntactic cues for indicating semantic roles first embodied in the notion of case \[Fillmore1968,Palmer1981\]. Some of these cues are quite general, while other cues are very verb-specific. The mapping rules can take advantage of generalities like 'SUBJECT to AGENT&amp;quot; syntactic cues while still preserving context sensitivities. This is accomplished by making the application of the mapping rules 'hituation-specific&amp;quot; through the use of PREDICATE ENVIRONMENTS. The previous rule is quite general and can be applied to every agent semantic role in this domain. This is ~ndicated by the X on the right hand side of the &amp;quot;/&amp;quot; which refers to the predicate environment of the agent, i.e., anything. Other rules, such as %VITH-PP to OBJECT2,&amp;quot; are much less general, and can only apply under a set of specific circumstances. The predicate environments for an objectl and object2 are specified more explicitly. An objectl can be the object of the sentence if it is contained in the semantic decomposition of a verb that includes an agent and belongs to the repair class of verbs. An object2 can be indicated by a with prepositional phrase if it is contained in the semantic decomposition of a replace verb:</Paragraph>
      <Paragraph position="6"/>
    </Section>
    <Section position="3" start_page="12" end_page="13" type="sub_section">
      <SectionTitle>
Selection Restrietlons
</SectionTitle>
      <Paragraph position="0"> The selection restriction on an agent is that it must be a field engineer, and an instrument must be a tool. The selection restrictions on the two objects are more complicated, since they must be machine parts, have the same type, and yet also be distinct objects. In addition, the first object must already be associated with something else in a haspart relationship, in other words it must already be included in an existing assembly. :The opposite must be true of the second object: it must not already be included in an assembly, so it must not be associated with anything else in a haspart relationship. null  There is also a pragmatic restriction associated with both objects that has not been associated with any of the semantic roles mentioned previously.</Paragraph>
      <Paragraph position="1"> Both object1 and object2 are essential semantic roles. Whether or not they are mentioned explicitly in the sentence, they must be filled, preferably b:C/ an an entity that has already been mentioned, but if not that, then entities will be created to fill them \[Palmer1983\]. This is accomplished by making an explicit cull to reference resolution to find referents for essential semantic roles, in the same way that reference resolution is called to find the referent of a noun phrase. This is not done for non-essential roles, such as the agent and the instrument in the same verb decomposition. If they are not mentioned they are simply left unfilled. The instrument is rarely mentioned, and the agent could easily be left out, as in The disk drive was replaced at 0800. 3 In other domains, the agent might be classified as obligatory, and then it wold have to be filled in.</Paragraph>
      <Paragraph position="2"> There is another semantic role that has an important pragmatic restriction on it in this example, the object2 semantic role in wait'for Apart (awp).</Paragraph>
      <Paragraph position="3"> idiomVerb(wait ^ for ^ part,time(Per)) &lt;ordered(object 1(O 1),obj ect2(O2),time(Per)) The semantics of wait &amp;quot;for &amp;quot;part indicates that a particular type of part has been ordered, and is expected to arrive. But it is not a specific entity that might have already been mentioned. It is a more abstract object, which is indicated by restricting it to being non-specific. This tells reference resolution that although a syntactic constituent, preferably the object, can and should fill this semantic role, and must be of type machine-part, that reference resolution should not try to find a specific referent for it (see Section 4).</Paragraph>
      <Paragraph position="4"> The last verb representation that is needed for the example is the representation of be.</Paragraph>
      <Paragraph position="6"> In this domain be is used to associate predicate adjectives or nominals with an object, as in disk drive is up or spindle motor is bad. The representation merely indicates that a modifier is associated with an theme in an attribute relationship. Noun phrase semantics will eventually produce the same representation for the bad spindle motor, although it does not yet.</Paragraph>
      <Paragraph position="7"> 3Note that an elided subject is handled quite differently, as in replaced tliBk tlri=e. Then the missing subject is assumed to fill the agent role, and an appropriate referent is found by reference resolution</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="13" end_page="14" type="metho">
    <SectionTitle>
4. Reference Resolution
</SectionTitle>
    <Paragraph position="0"> Reference resolution is the component which keeps track of references to entities in the discourse. It creates labels for entities when they are first directly referred to, or when their existence is implied by the text, and recognizes subsequent references to them. Reference resolution is called from clause semantics when clause semantics is ready to instantiate a semantic role. It is also called from pragmatic restrictions when they specify a referent whose existence is entailed by the meaning of a verb.</Paragraph>
    <Paragraph position="1"> The system currently covers many cases of singular and plural noun phrases, pronouns, oneanaphora, nominalizations, and non-specific noun phrases; reference resolution also handles adjectives, prepositional phrases and possessive pronouns modifying noun phrases. Noun phrases with and without determiners are accepted. Dates, part numbers, and proper names are handled as special cases. Not yet handled are compound nouns, quantified noun phrases, conjoined noun phrases, relative clauses, and possessive nouns.</Paragraph>
    <Paragraph position="2"> The general reference resolution mechanism is described in detail in \[Dahl1986\]. In this paper the focus will be on the interaction between reference resolution and clause semantics. The next two sections will discuss how reference resolution is affected by the different types of semantic roles.</Paragraph>
    <Section position="1" start_page="13" end_page="14" type="sub_section">
      <SectionTitle>
4.1. Obligatory Constituents and Essential
Semantic Roles
</SectionTitle>
      <Paragraph position="0"> A slot for a syntactically obligatory constituent such as the subject appears in the intermediate representation whether or not a subject is overtly present in the sentence. It is possible to have such a slot because the absence of a subject is a syntactic fact, and is recognized by the parser. Clause semantics calls reference resolution for such an implicit constituent in the same way that it calls reference resolution for explicit cqnstituents. Reference resolution treats elided noun phrases exactly as it treats pronouns, that is by instantiating them to the first member of a list of potential pronominal referents, the FocusList. The general treatment of pronouns resembles that of\[Sidnerl979\], although there are some important differences, which are discussed in detail in \[Dahl1986\]. The hypothesis that elided noun phrases can be treated in much the same way as pronouns is consistent with previous claims by \[Gunde11980\], and \[Kameyama1985\], that in languages which regularly allow zero-np's, the zero corresponds to the focus. If these claims are correct, it is not surprising that in a sublanguage that allows zero-np's, the zero should also correspond to the fOCUS.</Paragraph>
      <Paragraph position="1">  After control returns to clause semantics from reference resolution, semantics checks the selectional restrictions for that referent in that semantic role of that verb. If the selectional restrictions fail, back-tracking into reference resolution occurs, and the next candidate on the FocusList is instantiated as the referent. This procedure continues until a referent satisfying the selectional restrictions is found. For example, in Disk drive is down. Has select lock, the system instantiates the disk drive, which at this point is the first member of the FocusList, as the objectl of have:</Paragraph>
      <Paragraph position="3"> Essential roles might also not be expressed in the sentence, but their absence cannot be recognized by the parser, since they can be expressed by syntactically optional constituents. For example, in the field engineer replaced the motor., the new replacement motor is not mentioned, although in this domain it is classified as semantically essential. With verbs like replace, the type of the replacement, motor, in this case, is known because it has to be the same type as the replaced object. Reference resolution for these roles is called by pragmatic rules which apply when there is no overt syntactic constituent to fill a semantic role. Reference resolution treats these referents as if they were full noun phrases without determiners. That is, it searches through the context for a previously mentioned entity of the appropriate type, and if it doesn't find one, it creates a new discourse entity. The motivation for treating these as full noun phrases is simply that there is no reason to expect them to be in focus, as there is for elided noun phrases.</Paragraph>
      <Paragraph position="4"> 4.2. Noun Phrases in Non-Speclfie Contexts null Indefinite noun phrases in contexts like the field engineer ordered a disk drive are generally associated with two readings. In the specific reading the disk drive ordered is a particular disk drive, say, the one sitting on a certain shelf in the warehouse. In the non-specific reading, which is more likely in this sentence, no particular disk drive is meant; any disk drive of the appropriate type will do. Handling noun phrases in these contexts requires careful integration of the interaction between semantics and reference resolution, because semantics knows about the verbs that create non-specific contexts, and reference resolution knows what to do with noun phrases in these contexts. For these verbs a constraint is associated with the semantics rule for the semantic role object2 which states that the filler for the object2 must be non-specific. 4 This constraint is passed to reference resolution, which represents a non-specific noun phrase as having a variable in the place of the pointer, for example, id(motor~X).</Paragraph>
      <Paragraph position="5"> Non-specific semantic roles can be illustrated using the object2 semantic role in wait~for^part (awp). The part that is being awaited is nonspecific, i.e., can be any part of the appropriate type. This tells reference resolution not to find a specific referent, so the referent argument of the id relationship is left as an uninstantiated variable. The analysis of fe is awp spindle motor would fill the objectl semantic role with tel from id(fe,fel), and the object2 semantic role with X from id(spindle ~ motor,X), as in ordered(objectl(fel),object2(X)). If the spindle motor is referred to later on in a relationship where it must become specific, then reference resolution can instantiate the variable with an appropriate referent such as spindle^motor3 (See Section 5.6).</Paragraph>
    </Section>
  </Section>
  <Section position="8" start_page="14" end_page="17" type="metho">
    <SectionTitle>
5. Sample Text: A sentence-by-sentence
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="14" end_page="17" type="sub_section">
      <SectionTitle>
analysis
</SectionTitle>
      <Paragraph position="0"> The sample text given below is a slightly emended version of a maintenance report. The parenthetical phrases have been inserted. The following summary of an interactive session with PUNDIT illustrates the mechanisms by which the syntactic, semantic and pragmatic components interact to  produce a representation of the text.</Paragraph>
      <Paragraph position="1"> 1. disk drive (was) down (at) 11/16-2305.</Paragraph>
      <Paragraph position="2"> 2. (has) select lock.</Paragraph>
      <Paragraph position="3"> 3. spindle motor is bad.</Paragraph>
      <Paragraph position="4"> 4. (is) awp spindle motor.</Paragraph>
      <Paragraph position="5"> 5. (disk drive was) up (at) 11/17-1236.</Paragraph>
      <Paragraph position="6"> 6. replaced spindle motor.</Paragraph>
      <Paragraph position="7"> 5.1. Sentence 1: Disk drive was down at  11/16-230G.</Paragraph>
      <Paragraph position="8"> As explained in Section 3.2 above, the noun phrase disk drive leads to the creation of an id a\]&amp;quot; the form: id(dlsk~drlve,\[drlvel\]) Because'dates and names generally refer to unique entities rather than to exemplars of a general type, their ids do not contain a type argument: date(\[ll/16 null The interpretation of the first sentence of the report depends on the semantic rules for the predicate be. The rules for this predicate specify three semantic roles, an theme to whom or which is attributed a modifier, and the time. After a mapping rule in the semantic component of the system instantiates the theme semantic role with the sentence subject, disk drive, the reference resolution component attempts to identify this referent. Because disk drive is in the first sentence of the discourse, no prior references to this entity can be found. Further, this entity is not presupposed by any prior linguist, ic expressions. However, in the maintenance domain, when a disk drive is referred to it can be assumed to be part of a B3700 computer system. As the system tries to resolve the reference of the noun phrase disk drive by looking for previously mentioned disk drives, it finds that the mention of a disk drive presupposes the existence of a system. Since no system has been referred to, a pointer to a system is created at the same time that a pointer to the disk drive is created. Both entities are now available for future reference. In like fashion, the propositional content of a complete sentence is also made available for future reference. The entities corresponding to propositions are given event labels; thus eventl is the pointer to the first proposition. The newly created disk drive, system and event entities now appear in the discourse information in the form of a llst along with the date.</Paragraph>
      <Paragraph position="10"> Note however, that only those entities which have been explicitly mentioned appear in the FocusList: FocusList: \[\[event1\], \[drlvel\], \[11/16-2305\]\] The propositional entity appears at the head of the focus list followed by the entities mentioned in full noun phrases.fi In addition to the representation of the new event, the pragmatic information about the developing discourse now includes information about pa'rtwhole relationships, namely that drivel is a part which is contained in systeml.</Paragraph>
      <Paragraph position="11"> Part-Whole Relationships: haspart(\[systeml\],\[drivel\]) The complete representation of eventl, appearing in the event list in the form shown below, indicates that at the time given in the prepositional phrase at 11/16-2505 there is a state of affairs denoted as eventl in which a particular disk drive, i.e., drivel, can be described as down.</Paragraph>
      <Paragraph position="13"> 5.2. Sentence 2: Has select lock.</Paragraph>
      <Paragraph position="14"> The second sentence of the input text is a sentence fragment and is recognized as such by the parser. Currently, the only type of fragment which can be parsed can have a missing subject but must have a complete verb phrase. Before semantic analysis, the output of the parse contains, among other things, the following constituent list: \[subj(\[X\]),obj(\[Y\])\]. That is, the syntactic component represents the arguments of the verb as variables. The fact that there was no overt subject can be recognized by the absence of semantic information associated with X, as discussed in Section 3.2. The semantics for the maintenance domain sublanguage specilCles that the thematic role instantiated by the direct object of the verb to have must be a symptom of the entity referred to by the subject. Reference resolution treats an empty subject much like a pronominal reference, that is, it proposes the first element in the FoeusList as a possible referent.</Paragraph>
      <Paragraph position="15"> The first proposed referent, eventl is rejected by the semantic selectional constraints associated with the verb have, which, for this domain, require the role mapped onto the subject to be classified as a machine part and the role mapped onto the direct object to be classified as a symptom. Sincethe next item in the FocusList, drivel, is a machine part, it passes the selectional constraint and becomes matched with the empty subject of has select lock.</Paragraph>
      <Paragraph position="16"> Since no select lock has been mentioned previously, the system creates one. For the sentence as a whole then, two entities are newly created: the select lock (\[loekl\]) and the new propositional event (\[event2\]): id(event, \[event2\]), id(select^lock,\[loekl\]). The following representation is added to the event list, and the FoeusList and Ids are updated appropriately. 6 \[event2\] have(tlme(tlmel)) symptom(objectl(\[drivel\]), symptom( \[lock 1\]),time (tlmel)) s The order in which full noun phrase mentions are added to I, ne FocusList depends on their syntactic function and linear order, For full noun phrases, direct object mentions precede subject mentions followed by all other mentions given in the order in which they occur in the sentence. See \[Dahl1986\], for details.</Paragraph>
      <Paragraph position="17"> 6 This version only deals with explicit mentions of time, so for this sentence tile time argument is filled in with a gensym that standg for an unknown time period, The current version of FUNDlT uses verb tense and verb semantics to derive implicit time arguments.</Paragraph>
      <Paragraph position="18">  5.3. Sentence 3: Motor is bad.</Paragraph>
      <Paragraph position="19"> In the third sentence of the sample text, a new entity is mentioned, motor. Like disk drive from sentence 1, motor is a dependent entity. However, the entity it presupposes is not a computer system, but rather, a disk drive. The newly mentioned motor becomes associated with the previously mentioned disk drive.</Paragraph>
      <Paragraph position="20"> After processing this sentence, the new entity motor3 is added to the FocusList along with the new proposition event3. Now the discourse information about part-whole relationships contains information about both dependent entities, namely that motorl is a part of drivel and that drivel is a part of systeml.</Paragraph>
      <Paragraph position="21"> haspart(\[drivel\], \[motor 1\]) haspart(\[systeml\], \[drivel\]) 5.4. Sentence 4: is awp spindle motor.</Paragraph>
      <Paragraph position="22"> Awp is an abbreviation for an idiom specific to this domain, awaiting part. It has two semantic roles, one of which maps to the sentence subject.</Paragraph>
      <Paragraph position="23"> The second maps to the direct object, which in this case is the non-specific spindle motor as explained in Section 4.2. The selectlonal restriction that the first semantic role of awp be an engineer causes the reference resolution component to create a new engineer entity because no engineer has been mentioned previously. After processing this sentence, the list of available entities has been incremented by three: id(event, \[event4\]) id(part,\[ 2317\]) id(field ^ engineer, \[englneer 1\]) The new event is represented as follows:  the disk drive is presumed to be the same drive referred to previously, that is, drivel. The semantic analysis of sentence 5 is very similar to that of sentence 1. As shown in the following event representation, the predicate expressed by the modifier up is attributed to the theme drivel at the specified time.</Paragraph>
      <Paragraph position="24">  The sixth sentence is another fragment consisting of a verb phrase with no subject. As before, reference resolution tries to find a referent in the current FocusList which is a semantically acceptable subject given the thematic structure of the verb and the domain-specific selectional restrictions associated with them. The thematic structure of the verb replace includes an agent role to be mapped onto the sentence subject. The only agent in the maintenance domain is a field engineer. Reference resolution finds the previously mentioned engineer created for awp spindle motor, \[englneerl\]. It does not find an instrument, and since this is not an essential role, this is not a problem. It simply fills it in with another gensym that stands for an unknown filler, unknownl.</Paragraph>
      <Paragraph position="25"> When looking for the referent of a spindle motor to fill the objectl role, it first finds the non-specific spindle motor also mentioned in the awp spindle motor sentence, and a specific referent is found for it. However, this fails the selection restrictions, since although it is a machine part, it is not already associated with an assembly, so backtracking occurs and the referent instantiation is undone. The next spindle motor on the FocusList is the one from spindle motor is bad, (\[motorl\]). This does pass the selection restrictions since it participates in a haspart relationship.</Paragraph>
      <Paragraph position="26"> The last semantic role to be filled is the object2 role. Now there is a restriction saying this role must be filled by a machine part of the same type as objectl, which is not already included in an assembly, viz., the non-specific spindle motor. Reference resolution finds a new referent for it, which automatically instantiates the variable in the id term as well. The representation can be decomposed further into the two semantic predicates missing and included, which indicate the current status of the parts with respect to any existing assemblies.</Paragraph>
      <Paragraph position="27"> The haspart relationships are updated, with the old haspart relationship for \[motorl\] being removed, and a new haspart relationship for \[motor3\] being added. The final representation of the text will be passed through a filter so that it can be suitably modified for inclusion in a database.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML