File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/84/p84-1044_metho.xml
Size: 16,061 bytes
Last Modified: 2025-10-06 14:11:37
<?xml version="1.0" standalone="yes"?> <Paper uid="P84-1044"> <Title>REPRESENTING KNOWLEDGE ABOUT KNOWLEDGE AND MUTUAL KNOWLEDGE</Title> <Section position="4" start_page="0" end_page="194" type="metho"> <SectionTitle> I INTRODUCTION </SectionTitle> <Paragraph position="0"> Speech act representation and the language planning require that the system can reason about intensional concepts like knowledge and belief. A problem resolver must understand the concept of knowledge and know for example what knowledge it needs to achieve specific goals. Our assumption is that a theory of language is part of a theory of action (Austin \[4\] ).</Paragraph> <Paragraph position="1"> Reasoning about knowledge encounters the problem of intensionality. One aspect of this problem is the indirect reference introduced by Frege ~\] during the last century. Mc Carthy \[15\] presents this problem by giving the following example : Let the two phrases : Pat knows Mike's telephone number (I) and Pat dialled Mike's telephone number (2) The meaning of the proposition &quot;Mike's telephone number&quot; in (I) is the concept of the telephone number, whereas its meaning in (2) is the number itself.</Paragraph> <Paragraph position="2"> Then if we have : &quot;Mary's telephone number = Mike's telephone number&quot;, we can deduce that : &quot;Pat dialled Mary's telephone number&quot; but we cannot deduce that : &quot;Pat knows Mary's telephone number&quot;, because Pat may not have known the equality mentioned above.</Paragraph> <Paragraph position="3"> Thus there are verbs like &quot;to know&quot;, &quot;to believe&quot; and &quot;to want&quot; that create an &quot;opaque&quot; context. For Frege a sentence is a name, reference of a sentence is its truth value, the sense of a sentence is the proposition. In an oblique context, the reference becomes the proposition. For example the referent of the sentence p in the indirect context &quot;A knows that p&quot; is a proposition and no longer a truth value.</Paragraph> <Paragraph position="4"> Me Carthy \[15\] and Konolige \[I I\] have adopted Frege's approach. They consider the concepts like objects of a first-order language. Thus one term will denote Mike's telephone number and another will denote the concept of Mike's telephone number. The problem of replacing equalities by equalities is then avoided because the concept of Mike's telephone number and the number itself are different entities.</Paragraph> <Paragraph position="5"> Mc Carthy's distinction concept/object corresponds to Frege's sense/reference or to modern logicians' intension/extension.</Paragraph> <Paragraph position="6"> Maida and Shapiro \[13\] adopt the same approach but use propositional semantic networks that are labelled graphs, and that only represent intenslons and not extensions, that is to say individual concepts and propositions and not referents and truth values. We bear in mind that a semantic network is a graph whose nodes represent individuals and whose oriented arcs represent binary relations.</Paragraph> <Paragraph position="7"> Cohen E6\], being interested in speech act planning, proposes the formalism of partitioned semantic networks as data base to represent an agent's beliefs. A partitioned semantic network is a labelled graph whose nodes and arcs are distributed into spaces. Every node or space is identified by its own label. Hendrix ~9\] introduced it to represent the situations requiring the delimitation of information sub-sets.</Paragraph> <Paragraph position="8"> In this way Cohen succeeds in avoiding the problems raised by the data base approach.</Paragraph> <Paragraph position="9"> These problems are clearly identified by Moore FI7,18\]. For example to represent 'A does-not believe P', Cohen asserts Believe (A,P) in a global data base, entirely separated from any agent's knowledge base. But as Appelt ~\] notes, this solution raised problems when one needs to combine facts from a particular data base with global facts to prove a single assertion. For example, from the assertion : know (John,Q) & know (John,P ~Q) where P~ Q is in John's data base and ~ know (John,Q) is in the global data base, it should be possible to conclude % know (John,P) but a good strategy must be found ! In a nutshell, in this first approach which we will call a syntactical one, an agent's beliefs are identified with formulas in a first-order language, and propositional attitudes are modelled as relations between an agent and a formula in the object language, but Montague showed that modalities cannot consistently be treated as predicates applying to nouns of propositions.</Paragraph> <Paragraph position="10"> The other approach no longer considers the intenslon as an object but as a function from possible worlds to entities. For instance the intension of a predicate P is the function which to each possible world W (or more generally a point of reference, see Scott \[23\] ) associates the extension of P in W.</Paragraph> <Paragraph position="11"> This approach is the one that Moore D7,18\] adopted. He gave a first-order axiomatization of Kripke's possible worlds semantics \[12\] for Hintikka's modal logic of knowledge \[,0\].</Paragraph> <Paragraph position="12"> The fundamental assumption that makes this translation possible, is that an attribution of any propositional attitude like &quot;to know&quot;, &quot;to believe&quot;, &quot;to remember&quot;, &quot;to strive&quot; entails a division of the set of possible worlds into two classes : the possible worlds that go with the propositional attitude that is considered, and those that are incompatible with it. Thus &quot;A knows that P&quot; is equivalent to &quot;P is true in every world compatible with what A knows&quot;.</Paragraph> <Paragraph position="13"> We think that possible worlds language is complicated and unintuitive, since, rather than reasoning directly about facts that someone knows, we reason about the possible worlds compatible with what he knows. This translation also presents some problems for the planning. For instance to establish that A knows that P, we must make P true in every world which is compatible with A's knowledge. This set of worlds is a potentially infinite set.</Paragraph> <Paragraph position="14"> The most important advantage of Moore's approach \[17,183 is that it gives a smart axiomatization of the interaction between knowledge and action.</Paragraph> </Section> <Section position="5" start_page="194" end_page="195" type="metho"> <SectionTitle> II PRESENTATION OF OUR APPROACH </SectionTitle> <Paragraph position="0"> Our approach is comprised in the general framework of the second approach, but instead of encoding Hintikka's modal logic of knowledge in a first-order language, we consider the logic of knowledge proposed by Mc Carthy, the decidability of which was proved by Sato \[21\] and we propose a prover of this logic, based on natural deduction.</Paragraph> <Paragraph position="1"> We bear in mind that the idea of using the modal logic of knowledge in A.I.</Paragraph> <Paragraph position="2"> was proposed for the first time by Mc Carthy and Hayes \[14\].</Paragraph> <Paragraph position="3"> A. Languages A language L is a triple (Pr,Sp,T) where : .Pr is the set of propositional variables, null .Sp is the set of persons, .T is the set of positive integers.</Paragraph> <Paragraph position="4"> The language of classical propositional calculus is L = (Pr,6,~). SoCSp will also be denoted by 0 and will be called &quot;FOOL&quot;.</Paragraph> <Paragraph position="5"> B. Well Formed Formulas The set of well formed formulas is defined to be the least set Wff such as : C. Hilbert-type System KT4 The axiom schemata for KT4 are : At. Axioms of ordinary propositional logic null A2. (St)a * a A3. (Ot)a ~ (Or) (St)a A4. (St) (a D b) ~ ((Su)a D(Su)b), where t 6 u A5. (St)a ~ (St) (St)a A6. If a is an axiom, then (St)a is an axiom.</Paragraph> <Paragraph position="6"> Now, we give the meaning of axioms : (A2) says that what is known is true, that is to say that it is impossible to have false knowledge. If P is false, we cannot say : &quot;John knows that P&quot; but we can say &quot;John believes that P&quot;. This axiom is the main difference between knowledge and belief. null This distinction is important for plan- null ning because when an agent achieves his goals, the beliefs on which he bases his actions must generally be true.</Paragraph> <Paragraph position="7"> (A3) says that what FOOL knows at time t, FOOL knows at time t that anyone knows it at time t. FOOL's knowledge represents universal knowledge, that is to say all agents knowledge.</Paragraph> <Paragraph position="8"> (A4) says that what is known will remain true and that every agent can apply modus ponens, that is, he knows all the logical consequences of his knowledge.</Paragraph> <Paragraph position="9"> (A5) says that if someone knows something then he knows that he knows it. This axiom is often required to reason about plans composed of several steps. It will be referred to as the positive introspective axiom.</Paragraph> <Paragraph position="10"> (A6) is the rule of inference.</Paragraph> <Paragraph position="11"> D. Representation of the notion of truth va- null lue.</Paragraph> <Paragraph position="12"> We give a great importance to the representation of the notion of truth value of a proposition, for example the utterance : John knows whether he is taller than Bill (I) can be considered as an assertion that mentions the truth value of the proposition P = John is taller than Bill, without taking a position as to whether the latter is true or false. In our formalism (I) is represented</Paragraph> <Paragraph position="14"> This disjunctive solution is also adopted by Allen and Perrault D\]&quot; Maida and Shapiro \[13\] represent this notion by a node because the truth value is a concept (an object of thought).</Paragraph> <Paragraph position="15"> The representation of the notion of truth value is useful to plan questions : A speaker can ask a hearer whether a certain proposition is true, if the latter knows whether this proposition is true.</Paragraph> <Paragraph position="16"> E. Representing definite descriptions in conversational systems : Let us consider a dialogue between two participants : A speaker S and a hearer H. The language is then reduced to : Sp = (O,H,S} and T = {l} Let P stand for the proposition : &quot;The description D in the context C is uniquely satisfied by E&quot;.</Paragraph> <Paragraph position="17"> Clark and Marshall \[5\] give examples that show that for S to refer to H to some entity E using some description D in a context C, it is sufficient that P is a mutual knowledge; this condition is tantamount to (O)P is provable. Perrault and Cohen \[20\] show that this condition is too strong. They claim that an infinite number of conjuncts are necessary for suc- null with only a finite number of false conjuncts. Finally, Nadathur and Joshi ~9\] give the following expression as sufficient condition for using D to refer to E : (S) BD (S)(H) P & ~ ((S) BO(S)~(O)P) where B is the conjunction of the set of sentences that form the core knowledge of S and ~ is the inference symbole.</Paragraph> </Section> <Section position="6" start_page="195" end_page="196" type="metho"> <SectionTitle> III SCHOTTE - TYPE SYSTEM KT4' </SectionTitle> <Paragraph position="0"> Gentzen's goal was to build a formalism reflecting most of the logical reasonings that are really used in mathemati- null cal proofs* He is the inventor of natural deduction (for classical and intultionistic logics). Sato ~|\] defines Gentzen - type sysmen GT4 which is equivalent to KT4. We consider here, schStte-type system KT4' \[22\] which is a generalization of S4 and equivalent to GT4 (and thus to KT4), in order to avoid the thinning rule of the system GT4 (which introduces a cumbersome combinatory). Firstly, we are going to give some difinitions to introduce KT4'.</Paragraph> <Paragraph position="1"> A. Inductive definition of positive and negative parts of a formula F Logical symbols are ~ and V.</Paragraph> <Paragraph position="2"> a. F is a positive part of F.</Paragraph> <Paragraph position="3"> b. If % A is a positive part of F, then A is a negative part of F.</Paragraph> <Paragraph position="4"> c. If ~ A is a negative part of F, then A is a positive part of F.</Paragraph> <Paragraph position="5"> d. If A V B is a positive part of F, then A and B are positive parts of F.</Paragraph> <Paragraph position="6"> Positive parts or negative parts which do not contain any other positive parts or negative parts are called minimal parts.</Paragraph> <Paragraph position="7"> B. Semantic property The truth of a positive part implies the truth of the formula which contains this positive part.</Paragraph> <Paragraph position="8"> The falsehood of a negative part implies the truth of the formula which contains this negative part.</Paragraph> <Paragraph position="9"> C. Notation F\[A+\] is a formula which contains A as a positive part F\[A-\] is a formula which contains A as a negative part.</Paragraph> <Paragraph position="10"> F\[A+,B-\] is a formula which contains A as a positive part and B as a negative part where A and B are disjoined (i.</Paragraph> <Paragraph position="11"> e, o~e is not a subformula of the o- ther). D. Inductive definition of F \[.j From a formula F \[A\], we build another formula or the empty formula F \[.\] by dele- null ting A : a. If F \[A 3 deg A, then F\[.\] is the empty formula.</Paragraph> <Paragraph position="12"> c. If F G\[A V BJ or = G V AJ then . = G \[BJ.</Paragraph> <Paragraph position="13"> E. Axiom An axiom is any formula of the form F\[P+,P-\] where P is a propositional variable. null formed into a KT4' proof-figure with the same conclusion and without any cut as a rule of inference (hence, the rule (R4) is superfluous. The proof of this theorem is an extension of Sch~tte's one for $4'. This theorem allows derivations &quot;without detour&quot;*</Paragraph> </Section> <Section position="7" start_page="196" end_page="198" type="metho"> <SectionTitle> IV DECISION PROCEDURE </SectionTitle> <Paragraph position="0"> A logical axiom is a formula of the form F\[P+,P-\]. A proof is an single-rooted tree of formulas all of whose leaves are logical axioms. It is grown upwards from the root, the rules (RI), (R2) and (R3) must be applied in a reverse sense.</Paragraph> <Paragraph position="1"> These reversal rules will be used as &quot;production rules&quot;* The meaning of each production expressed in terms of the progranting language PROLOG is an implication* It can be shown \[24J that the following strategy is a complete proof procedure : * The formula to prove is at the star- null ring node; * Queue the minimal parts in the given formula; null * Grow the tree by using the rule (R|) in priority , followed by the rule (R2), then by the rule (R3).</Paragraph> <Paragraph position="2"> The choice of the rule to apply can be done intelligently. In general, the choice of (RI) then (R2) increases the likelihood to find a proof because these (reversal) rules give more complex formulas. In the case where (R3) does not lead to a loss of formulas, it is more efficient to choose it at first* The following example is given to illustrate this strategy :</Paragraph> <Section position="1" start_page="197" end_page="198" type="sub_section"> <SectionTitle> Example </SectionTitle> <Paragraph position="0"> Take (A4) as an example and let Fo denotes its equivalent version in our language (Fo is at the start node) :</Paragraph> <Paragraph position="2"> Finally, we have to apply (R2) to the last but one node :</Paragraph> <Paragraph position="4"> is a logical axiom because P51~ F 5 =\[a} The generated derivation tree is then :</Paragraph> </Section> </Section> class="xml-element"></Paper>