File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/78/t78-1031_metho.xml
Size: 19,277 bytes
Last Modified: 2025-10-06 14:11:12
<?xml version="1.0" standalone="yes"?> <Paper uid="T78-1031"> <Title>PATH-BASED AND NODE-BASED INFERENCE IN SEMANTIC NETWORKS</Title> <Section position="4" start_page="219" end_page="219" type="metho"> <SectionTitle> PROP @ ISA FIGURE I: ISA hierarchy based on that of Collins and Quillian 3. Node-Based Inference </SectionTitle> <Paragraph position="0"> Several semantic network systems incorporate methods of representing general rules in a semantic network version of predicate calculus. Among these systems are those of Shapiro \[14;15;17\], Kay \[7\], Hendrix \[6\], Schubert \[12\], and Fikes and Hendrix \[3\]. Figure 2 shows such a network deduction rule representing</Paragraph> <Paragraph position="2"> Figure 3 shows a rule for</Paragraph> <Paragraph position="4"> The network formalism employed is that of Shapiro \[15;17\]. These deduction rules employ pattern nodes (P1, P2, PS, P4, P5, P6, PY), each one of which represents a pattern of nodes that might occur in the network. We will therefore call this kind of inference rule a node-based inference rule. Pattern nodes are related to each other by rule nodes, each of which represent a propositional operator, or, equivalently, an inference mechanism. For example, R2 represents the rule that if an instance of P1 occurs in the network, an instance of R1 with the same substitution for z may be deduced. Quantification is represented in this notation by an arc-relation between a rule node and the variable nodes bound in the rule. For example, z is bound by a universal quantifier in R2 and y is bound by an existential quantifier in R1.</Paragraph> <Paragraph position="6"> To see how a node-based inference proceeds, consider the network of Figure 4 in conjunction with the rule of Figure 3, and say that we wish to decide if A SUPPORTS C. The network that would represent that A SUPPORTS C matches P7 with the variable binding \[z/A, r/SUPPORTS, a/C\]. P4 in the binding Jr/SUPPORTS\] is matched against the network and is found to successfully match M1. P5 \[z/A, r/SUPPORTS, y/y\] and P6\[y/y, r/SUPPORTS, z/C\] are then both matched against the network and each succeeds with a consistent binding of y to B. The rule thus succeeds and A SUPPORTS C is deduced.</Paragraph> <Paragraph position="7"> (Details of the bindings and the match routine are given in \[15\].) A SUPPORTS B; B SUPPORTS C and SUFPORIS C/ TRANSITIVE~ It should be noted that set inclusion was represented by an arc (\[SA) in Section 2, but set membership is being represented by a node (with a MEMBER, CLASS &quot;case frame&quot;) in this section. The nodal representation is required by node-based inference rules and is consistent with the notion that everything that the network &quot;knows&quot;, and every concept to which the network can refer is represented by a node.</Paragraph> </Section> <Section position="5" start_page="219" end_page="219" type="metho"> <SectionTitle> 4. Advantages of Node-Based Inference </SectionTitle> <Paragraph position="0"> The advantages of node-based inference stem from the generality of the syntax of node-based inference rules. Path-based rules are limited to binary relations, have a restricted quantification structure and require that an arc between two nodes be implied by a path between the same two nodes. Rule R2 of Figure 2 could not be written as a path-based rule, and, although the transitivity of SUPPORTS could be expressed by a ~ath-based rule (SUPPORTS / SUPPORTS+). the &quot;second order&quot; rule R4 of Figure 3 could not.</Paragraph> <Paragraph position="1"> Let us b~iefly consider how rule R4 is constructed, whether it really is or is not a second order rule, and why it could not be expressed as a path-based rule.</Paragraph> <Paragraph position="2"> Rule R4 supplies a rule for use with transitive relations. In order to assert that a relation is transitive (e.g. assertion node M1 of Figure 4), the relation must be represented as a node, rather than as an arc. This also allows quantification over such relations, since in all node-based inference rule formalisms variables may only be substituted for nodes, not for arcs. Since the relation is a node, another node must be used to show the relationship of the relation to its arguments (e.g. nodes M2 and M3 in Figure q). Thus, Rq is really a first order rule derived from the second order rule</Paragraph> <Paragraph position="4"> by reducing r tO an individual variable and introducing a higher order relation, AVO, whose second argument is a conceptual relation and whose other arguments are conceptual individuals. So R4 is more accurately seen as the first order rule</Paragraph> <Paragraph position="6"> In this view, the predicates of semantic networks are not the nodes representing conceptual relations, but the different case frames. Rule Rq cannot be represented as a path-based rule because it is a rule about the relation AVO, and AVO is a trinary, rather than a binary relation.</Paragraph> <Paragraph position="7"> Although some node-based inference rules cannot be expressed by path-based inference rules, it is easy to see that any path-based inference rule can be expressed by a node-based inference rule, as long as we are willing to replace some arc-relations by nodes and higher order predicates.</Paragraph> </Section> <Section position="6" start_page="219" end_page="221" type="metho"> <SectionTitle> 5. Advantages of Path-Based Inference </SectionTitle> <Paragraph position="0"> The major advantage of path-based inference is efficiency. Carrying out a path-based inference involves merely checking that a specified path exists in the network between twogiven nodes (plus, perhaps, some side paths to specified nodes required by domain and range restrictions). This is a well understood and relatively efficient operation, especially compared to the backtracking, intersection, or unification operations required to check the consistency of variable substitutions in node-based inference rules. Moreover, path following seems to many people to be what semantic networks were originally designed for. The major search algorithm of Quillian's semantic Memory is a hi-directional search for a path connecting two nodes \[10, p. 249\].</Paragraph> <Paragraph position="1"> Also, the ability to do path tracing is a motivation underlying ISAhierarchies, and is why the Collins and Quillian results \[2\] gained such attention. These efficiencies are lost by replacing path-based inference rules by node-based inference rules.</Paragraph> </Section> <Section position="7" start_page="221" end_page="223" type="metho"> <SectionTitle> 6. Comb inin~ Path-Based and Node-Based Inference </SectionTitle> <Paragraph position="0"> We begin the task of unifying path-based and node-based inferences by noting the formal equivalence between an arc-relation and a two case case frame. Figure 5 illustrates this using ISA vs. SUB-SUP.</Paragraph> <Paragraph position="1"> Figure 5a shows the use of the \[SA arc-relation to represent that canaries are birds. Figure 5b represents the same relationship by a SUB-SUP case frame, and has the advantage that the relationship is represented by a node, M4. Figure 5c is a redrawing of 5b, using the arc label SUBto represent the relation SUB C. (It is generally understood in semantic network formalisms that whenever an arc representing a relation R goes from some node n to some node m, there is also an arc representing R c going from m to -). Figure 5c clarifies the notion that we may think of an instance of a two case case frame (such as M4) as both an arc and a node if we are willing to recalibrate the measurement of time it takes to follow one arc-relation to be the time it takes to follow two arcs. We can replace all instances of \[SA in the path-based inference rules of Section 2 by the composition SUB-/SUP and still have valid rules except that we now have paths on the left of the &quot;/&quot; symbol. valence of an arc-relation to a two case case frame, a) I~presenting set membership as the \[SA arc-relation, b) Repre- senting set membership as a SUB-SUP case frame, c) Redrawing (b) so it looks like (a}.</Paragraph> <Paragraph position="2"> Let us, therefore, extend our syntax of path-based inference rules to allow a path of arc compositions on the left of the &quot;/&quot; symbol. The rule ISA / ISA~ states that whenever there is a path of ISA arcs from node , to mode m, we can infer a &quot;virtual&quot; ISA arc directly from , to m which we may, if we wish, actually add to the network. Similarly, let the rule SUB-/SUP / (SUB-/SUP) ~ state that whenever a path of alternating SUB- and SUP arcs goes from node n to node m, we can infer a &quot;virtual&quot; node with SUB to n and SUP to m which we may, if we wish, actually add to the network.</Paragraph> <Paragraph position="3"> We now have a formalism for specifying path-based inference rules in a network formalism that represents binary conceptual relations by two case case frames.</Paragraph> <Paragraph position="4"> This would allow, for example, for a more unified representation in the SNIFFER system \[3\], in which node-based inference rules are implemented and built-in path based inference rules are used for set membership and set inclusion, both of which are represented only by arc-relations. The formalism presented here would allow set membership and set inclusion assertions to be represented by nodes, permitting other assertions to reference them, without giving up the efficiency of built-in routines to implement the set inclusion hierarchies.</Paragraph> <Paragraph position="5"> We desire, however, a more general unification of path-based and node-based inferences. There are two basic routines used to implement node-based inferences (although specific implementations may differ). One is the match routine that is given a pattern node and finds instances of it in the network, and the other is the routine that interprets the quantifiers and connectives to carry out the actual deduction. The match routine can be enhanced to make use of path-based inference rules. Consider a typical match routine used in the deduction in Section 3 of A SUPPORTS C from the network of Figure 4 and the rule of Figure 3, and let us introduce the notation that if P is a path of arcs and n is a node, P\[,\] represents the set of nodes found by following the path P from the node n. In the example, the match routine was given the pattern P4 to match in the binding It/SUPPORTS\].</Paragraph> <Paragraph position="6"> It was able to find MI by intersecting CLASSC\[TRAN~ITIVE\] with MEMBERC\[SuPPORTS\]. Now, let us suppose that the path-based inference rule CLASS / CLASS/ (SUB-/SUP) ~ has been declared in such a way that the match routine could use it. T~e match routine would intersect MEMBER~\[SUPPORTS\] wi~h ~LASS/(SUB-/SUP)*)C\[TRANSITIVE\] and be able to find a virtual node asserting that SUPPORTS is TRANSITIVE even if a long chain of set inclusions separated them.</Paragraph> <Paragraph position="7"> The proposal, therefore, is this: any arc-relation in a semantic network may be defined in terms of a path-based inference rule which the match routine is capable of using when finding instances of pattern nodes. This completes the general unification of path-based and node-based inference we desired. Since path-based inference is embedded in the match routine, while node-based inference requires the quantifier and connective interpreter, the difference is reminiscent of the difference between subconscious inference and conscious reasoning.</Paragraph> <Paragraph position="8"> 7. Application to Extensional</Paragraph> <Section position="1" start_page="222" end_page="223" type="sub_section"> <SectionTitle> Equivalence of Intensional Concepts </SectionTitle> <Paragraph position="0"> A basic assumption of semantic networks is that each concept is represented by a single node and that all information about a concept is reachable from its node.</Paragraph> <Paragraph position="1"> Yet, since Woods' discussion \[20\], most semantic network authors have agreed that a node represents an intensional, rather than an extensional concept. How should we handle the information that two different intensional concepts are extensionally equivalent? Let us illustrate this by a story (entirely fictional). For the last year we have heard of a renowned surgeon in town, Dr. Smith, known for his brilliance and dexterity, who saved the life of the famous actress Maureen Gelt by a difficult heart transplant operation. Meanwhile, at several social gatherings, we have met someone by the name of John Smith, about five feet, six inches tall, black hair and beard, generally disheveled and clumsy.</Paragraph> <Paragraph position="2"> We now discover, much to our amazement that John Smith and Dr. Smith are one and the same! In our semantic network, we have one node for Dr. Smith connected to his attributes, and another for John Smith connected to his attributes. What are we to do? Although we now know that John Smith saved the life of Maureen Gelt and that Dr. Smith has black hair, surely we cannot retrieve that information as fast as that Dr. Smith is a surgeon and that John Smith is 5'6&quot; tall. If we were to combine the two nodes by taking all the arcs from one node, tying them to the other and throwing away the first, we would lose this distinction. We must introduce an assertion, say an EQUIV-EQUIV case frame, that represents the fact that Dr. Smith and John Smith, different intensional concepts, are extensionally the same. I How are we to use this assertion? Ignoring for the moment referentially opaque contexts (&quot;We didn't know that John Smith was Dr. Smith.&quot;), how can we express the rule that if n EQUIV-/EQUIV m, than anything true of n is true of m? Our node based inference rules cannot express this rule because expressing &quot;anything true of n&quot; requires quantifying over those higher order case frame predicates such as AVO IThe psychological plausibility of this discussion is supported by the experiments of Anderson and Hastie \[I\] and of McNabb \[9\].</Paragraph> <Paragraph position="3"> and MEMBER-CLASS- One possibility is to use lambda abstraction as Schubert does \[12\]. Each n-ary higher order predicate involving some node becomes a unary predicate by replacing that node by a lambda variable. Thus, &quot;Dr. Smith saved Maureen Gelt's life&quot; becomes an instance of the unary predicate A(x)\[x saved Maureen Gelt's life\] applied to Dr. Smith. Using a PRED-ARG case frame, it is easy to represent the rule Vx,y,Z\[EQUIV-EQUIV(X,y) 6 PRED-ARG(X,z) + PRED-ARG(Y=,~)\]-The trouble with this solution is, how are we to retrieve this information as a fact about Maureen Gelt? Must we also store l(x) \[Dr. Smith saved x's life\] (Maureen Gelt)? This duplication is unsatisfying. An alternative is to include in the path-based inference rule defining each arc-relation the path (EQUIV-/EQUIV) ~. For example, AGENT / AGENT/(EQUIV-/EQUIV) ~, and CLASS + CLASS/((EQUIV-/EQUIV)*/(SUB-/SUP)*)*.</Paragraph> <Paragraph position="4"> Although this solution requires more rules than the lambda abstraction solution, and the rules look complicated, it avoids the duplication of the same assertion in different forms and the postulation of conceptual predicates such as l(x)\[x saved Maureen Gelt's life\].</Paragraph> <Paragraph position="5"> Hays' cognitive networks \[4;5\] include a scheme similar to the one proposed here. Each assertion about Dr. Smith would refer to a different node, each with an MST (manifestation) arc to a common node. This node would represent the in tension of Dr. Smith, while the others represent Dr. Smith as surgeon, Dr. Smith as saviour of Maureen Gelt, etc. Presumably, when Hays' network learns of the identity of Dr. Smith with John Smith, a new node is introduced with MS~ arcs from both Dr. Smith and John Smith. L Dr. Smith and John Smith are then seen as two manifestations of the newly integrated Dr.</Paragraph> <Paragraph position="6"> John Smith. Hays presumably uses an MST*/(MSTC)* path where we propose an (EQUIV-/EQUIV)* path.</Paragraph> <Paragraph position="7"> Blocking referentially opaque contexts seems to require introducing relationa~ oomplement. For any path P and nodes x and y, let z~y hold just in case a path P from = to ~ does not exist in the network. We might block referentially opaque contexts by includin~ the domain or range restriction (OBJ-/VERB/MEMBER-/CLASS OPAQUE) in the arc definitions.</Paragraph> <Paragraph position="8"> 8. Application tothe Explication of Inheritance As was mentioned in Section 2, many 2Actually, Hays' networks have not yet been implemented, and I have been warned \[R. Fritzson, personal communication\] that the implementation may differ from what I have supposed.</Paragraph> <Paragraph position="9"> semantic networks include inheritance (ISA) hierarchies. Often these are at best vague and at worst inconsistent. We propose that the inheritance properties of these hierarchies be clearly defined by path-based inference rules using the syntax we are presenting here or some other well defined syntax. We do not say that all systems should be able to input and interpret such rules, but only that authors use such rules to explain clearly to their readers how their hierarchies work.</Paragraph> <Paragraph position="10"> Before this proposal is feasible, we must be able to handle two more situations. The first is the exception principle, first expressed by Raphael \[11, p.85\] and succinctly stated by Winograd as, &quot;Any property true of a concept in the hierarchy is implicitly true of anything linked below it, unless explicitly contradicted at the lower level&quot; \[19, p.197\]. To allow for this, let us introduce an ezoep-Lion ope~a~o2.. If P and Q are paths and x and y are nodes, let zP\Qy hold just in case there is a path described by P from z to y and no path of equal or shorter length described by Q from x to y. To see that this suffices to handle the exception principle, consider the hierarchy of Figure 6, where, to make things more interesting, we have postulated a variety of flying penguins. We have also taken the liberty of explicitly representing that ~AN-FLY and CAN-NOT-FLY are negations of each other. The rule for inheritance in this hierarchy is The other situation we must discuss is &quot;almost transitive&quot; relations such as SIBLING- SIBLING is certainly symmetric, but it cannot be transitive since it is irreflexive. Yet your sibling's sibling is your sibling as long as he/she is not yourself. This is what we mean by &quot;almost transitive.&quot; Note that for any relation, R, R*~(R+) is the identity relation. Let us call it I. Then for any relation P, let pR be P&T. pR is the ~rrefZexiue mestri~%io, of P We can use this to define SIBLING as SIBLING + (SIBLINGvSIBLINGC) *R-We suggest that the syntax for path-based inference rules is now complete enough to explicate the inheritance rules of various hierarchies. The complete syntax will be summarized in the next section</Paragraph> </Section> </Section> class="xml-element"></Paper>