XML Viewer - w06-1319

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/06/w06-1319_metho.xml
Size: 33,851 bytes
Last Modified: 2025-10-06 14:10:38
<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-1319">
  <Title>Balancing Con icting Factors in Argument Interpretation</Title>
  <Section position="4" start_page="134" end_page="134" type="metho">
    <SectionTitle>
2 Argument interpretation
</SectionTitle>
    <Paragraph position="0"> We de ne an interpretation of a user's argument as the tuple fSC,IG,EEg, where SC is a supposition con guration, IG is an interpretation graph, and EE are explanatory extensions.</Paragraph>
    <Paragraph position="1"> A Supposition Con guration is a set of suppositions attributed to the user (in addition to or instead of shared beliefs) to account for the beliefs in his or her argument.</Paragraph>
    <Paragraph position="2"> An Interpretation Graph is a domain structure, in our case a subnet of the domain BN, that connects the nodes mentioned in the argument. The nodes and arcs that are included in an interpretation graph but were not mentioned by the user ll in additional detail from the BN, bridging inferential leaps in the argument.</Paragraph>
    <Paragraph position="3"> Explanatory Extensions are domain structures (subnets of the domain BN) that are added to an interpretation graph to justify an inference. Contrary to suppositions, these explanations contain propositions believed by the user and the system. The presentation of these explanations is motivated by the results of our early trials, where people objected to belief discontinuities between the antecedents and the consequent of inferences, i.e., increases in certainty or large changes in certainty (Zukerman and George, 2005).</Paragraph>
    <Paragraph position="4"> To illustrate these components, consider the example in Figure 1. The top segment contains a short argument, and the bottom segment contains its interpretation. The middle segment contains an excerpt of the domain BN which includes the interpretation; the probabilities of some nodes are indicated with linguistic terms.2 The interpretation graph, which appears inside a light gray bubble in the BN excerpt, includes the extra node GreenInGardenAtTimeOfDeath (boxed).</Paragraph>
    <Paragraph position="5"> Note that the propagated beliefs in this interpretation graph do not match those in the argument. To address this problem, the system supposes that the user believes that TimeOfDeath11=TRUE, instead of the BN belief of Probably (boldfaced and 2We use the terms Very Probable, Probable, Possible and their negations, and Even Chance. These terms, which are similar to those used in (Elsaesser, 1987), are most consistently understood by people according to our user surveys.</Paragraph>
  </Section>
  <Section position="5" start_page="134" end_page="134" type="metho">
    <SectionTitle>
ARGUMENT
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="134" end_page="134" type="sub_section">
      <SectionTitle>
Mr Green
</SectionTitle>
      <Paragraph position="0"> he had the opportunity to kill Mr Body, butpossibly possibly being in the garden at 11 implies thatprobably he did murder Mr Body.not</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="134" end_page="135" type="metho">
    <SectionTitle>
INTERPRETATION
</SectionTitle>
    <Paragraph position="0"> Hence, he supposing that the time of death is 11 Mr Green being in the garden at 11, and had the opportunity to kill Mr Body, butpossibly probably Mr Green probably was in the garden at the time of death. implies that possibly not Mr Green probably did not have the means. Therefore, he did murder Mr Body.  pretation gray-boxed). This xes the mismatch between the probabilities in the argument and those in the interpretation, but one problem remains: in early trials we found that people objected to belief discontinuities, such as the jump in belief from possibly having opportunity to possibly not murdering Mr Body (this jump appears both in the original argument and in the interpretation, whose beliefs now match those in the argument as a result of the supposition). This prompts the generation of the explanatory extension GreenHad-Means[ProbablyNot] (white boldfaced and darkgray boxed). The three elements added during the interpretation process the extra node in the interpretation graph, the supposition and the explanatory extension appear in boldface italics in the interpretation at the bottom of the gure.</Paragraph>
    <Section position="1" start_page="134" end_page="135" type="sub_section">
      <SectionTitle>
2.1 Proposing Interpretations
</SectionTitle>
      <Paragraph position="0"> The problem of nding the best interpretation is exponential. In previous work, we proposed an anytime algorithm to propose interpretation graphs and supposition con gurations until time runs out (George et al., 2004). Here we apply our algorithm to generate interpretations comprising supposition con gurations (SC), interpretation graphs (IG) and explanatory extensions (EE) (Figure 2).</Paragraph>
      <Paragraph position="1"> Supposition con gurations are proposed rst, as instantiated beliefs affect the plausibility of inter- null Algorithm GenerateInterpretations(Arg) while fthere is timeg f  1. Propose a supposition con guration SC that accounts for the beliefs stated in the argument. 2. Propose an interpretation graph IG that connects the nodes in Arg under supposition conguration SC.</Paragraph>
      <Paragraph position="2"> 3. Propose explanatory extensions EE for interpretation graph IG under supposition con guration SC if necessary.</Paragraph>
      <Paragraph position="3"> 4. Calculate the probability of interpretation fSC,IG,EEg.</Paragraph>
      <Paragraph position="4"> 5. Retain the top N (=6) most probable interpre- null pretation graphs, which in turn affect the need for explanatory extensions. The proposal of supposition con gurations, interpretation graphs and explanatory extensions is driven by the probability of these components. In each iteration, we generate candidates for a component, calculate the probability of these candidates in the context of the selections made in the previous steps, and probabilistically select one of these candidates. That is, higher probability candidates have a better chance of being selected than lower probability ones (our selection procedures are described in George et al., 2004). For example, say that in Step 1, we selected supposition con guration SCa. Next, in Step 2, the probability of candidate IGs is calculated in the context of the domain BN and SCa, and one of the IGs is probabilistically selected, say IGb. Similarly, in Step 3, one of the candidate EEs is selected in the context of SCa and IGb. In the next iteration, we probabilistically select an SC (which could be a previously chosen one), and so on. To generate diverse interpretations, if SCa is selected again, a different IG will be chosen.</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="135" end_page="139" type="metho">
    <SectionTitle>
3 Probabilistic formalism
</SectionTitle>
    <Paragraph position="0"> Following (Wallace, 2005), our approach requires the speci cation of three elements: background knowledge, model and data. Background knowledge is everything known to the system prior to interpreting a user's argument, e.g., domain knowledge, shared beliefs with the user, and dialogue history; the data is the argument; and the model is the interpretation.</Paragraph>
    <Paragraph position="1"> We posit that the best interpretation is that with the highest posterior probability.</Paragraph>
    <Paragraph position="3"> where q is the number of interpretations.</Paragraph>
    <Paragraph position="4"> After applying Bayes rule, this probability is represented as follows.3</Paragraph>
    <Paragraph position="6"> where a is a normalizing constant that ensures that the probabilities of the interpretations sum to 1parenleftbigg</Paragraph>
    <Paragraph position="8"> The rst factor represents model complexity, and the second factor represents data t.</Paragraph>
    <Paragraph position="9"> Model complexity measures how dif cult it is to produce the model (interpretation) from the background knowledge. The higher/lower the complexity of a model, the lower/higher its probability.</Paragraph>
    <Paragraph position="10"> Data t measures how well the data (argument) matches the model (interpretation). The better/worse the match between the argument and an interpretation, the higher/lower the probability that the speaker intended this interpretation when he or she uttered the argument.</Paragraph>
    <Section position="1" start_page="135" end_page="136" type="sub_section">
      <SectionTitle>
Model Complexity
</SectionTitle>
      <Paragraph position="0"> Model complexity is a function fB,Mg![0, 1] that represents the prior probability of the model M (i.e., the interpretation) in terms of the background knowledge B. The calculation of model complexity depends on the type of the model: numerical or structural.</Paragraph>
      <Paragraph position="1"> The probability of a numerical model depends on the similarity between the numerical values (or distributions) in the model and those in the background knowledge. The higher/lower this similarity, the higher/lower the probability of the model. For instance, a supposition con guration SC comprising beliefs that differ signi cantly from those in the background knowledge will lower the probability of an interpretation. One of the functions we have used to calculate belief probabilities is the Zipf distribution, where the parameter is the difference between beliefs, e.g., between the supposed  directly. However, it is not clear how to incorporate the priors of an interpretation in the direct calculation.</Paragraph>
      <Paragraph position="2">  beliefs and the corresponding beliefs in the background knowledge (Zukerman and George, 2005).</Paragraph>
      <Paragraph position="3"> That is, the probability of a supposed belief in proposition P according to model M (bel M(P)), in light of the belief in P according to background knowledge B (bel B(P)), is Pr(bel M(P)jbel B(P))= thjbel M(P) bel B(P)jg where th is a normalizing constant, and g determines the penalty assigned to the discrepancy between the beliefs in P. For example,</Paragraph>
      <Paragraph position="5"> as TRUE is closer to Probable than to EvenChance.</Paragraph>
      <Paragraph position="6"> The probability of a structural model (e.g., an interpretation graph) is obtained from the probabilities of the elements in the structure (e.g., nodes and arcs) in light of the background knowledge.</Paragraph>
      <Paragraph position="7"> The simplest calculation assumes that the probability of including nodes and arcs in an interpretation graph is uniform. That is, the probability of an interpretation graph comprising n nodes and a arcs is a function of the probability of n, the probability of selecting n particular nodes from N nodes in the domain BN: parenleftbigNnparenrightbig[?]1, the probability of a, and the probability of selecting a particular arcs from the arcs that connect the n selected nodes.</Paragraph>
      <Paragraph position="8"> This calculation generally prefers small models to larger models.4 Data t Data t is a function fM,Dg ! [0, 1] that represents the probability of the data D (argument) given the model M (interpretation). This probability hinges on the similarity between the model and the data the closer the data is to the model, the higher is the probability of the data.</Paragraph>
      <Paragraph position="9"> The calculation of the similarity between numerical data and a numerical model is the same as the calculation of the similarity between a numerical model and background knowledge.</Paragraph>
      <Paragraph position="10"> The similarity between structural data and a structural model is a function of the number and type of operations required to convert the model into the data, e.g., node and arc insertions and 4In the rare cases where n &gt; N/2, smaller models do not yield lower probabilities.</Paragraph>
      <Paragraph position="11"> deletions. For the example in Figure 1, to convert the interpretation graph into the argument, we must delete one node (GreenInGardenAtTimeOf-Death) and its incident arcs. The more operations need to be performed, the lower the similarity between the data and the model, and the lower the probability of the data given the model.</Paragraph>
      <Paragraph position="12"> We now discuss our basic probabilistic formalism, which accounts for interpretation graphs, followed by two enhancements: (1) a more complex model that accounts for suppositions; and (2) increases in background knowledge that yield a preference for larger interpretation graphs under certain circumstances, and account for explanatory extensions.</Paragraph>
    </Section>
    <Section position="2" start_page="136" end_page="137" type="sub_section">
      <SectionTitle>
3.1 Basic formalism: Interpretation graphs
</SectionTitle>
      <Paragraph position="0"> In the basic formalism, the model contains only an interpretation graph. Thus, Equation 1 is simply</Paragraph>
      <Paragraph position="2"> The difference in the calculations of model complexity and data t for numerical and structural information warrants the separation of structure and belief, which yields  culate the probability of (or belief in) the nodes in IGi. Rather, it calculates how probable are these beliefs in light of the structure of IGi and the expectations from the background knowledge. For instance, if the belief in a node is p, it calculates the probability of p. This probability depends on the closeness between the beliefs in IGi and the expected ones. Since the beliefs in IGi are obtained algorithmically by means of Bayesian propagation from the background knowledge, they match precisely the expectations. Hence, Pr(bel IGijstruc IGi) = 1.</Paragraph>
      <Paragraph position="3"> We also make the following simplifying assumptions for situations where the interpretation is known (given): (1) the probability of the beliefs in the argument depends only on the beliefs in the  interpretation (and not on its structure or the argument's structure), and (2) the probability of the argument structure depends only on the interpretation structure (and not on its beliefs). This yields</Paragraph>
      <Paragraph position="5"> Table 1 summarizes the calculation of these probabilities separated according to model complexity and data t. It also shows the trade-off between structural model complexity and structural data t. As seen at the start of Section 3, smaller structures generally have a lower model complexity than larger ones. However, an increase in structural model complexity (indicated by the | next to the structural complexity and the  |next to the resultant probability of the model) may reduce the structural discrepancy between the argument structure and the structure of the interpretation graph (indicated by the  |next to the structural discrepancy and the  |next to the probability of the structural data- t). For instance, the smallest possible interpretation for the argument in Figure 1 consists of a single node, but this interpretation has a very poor data t with the argument.</Paragraph>
    </Section>
    <Section position="3" start_page="137" end_page="137" type="sub_section">
      <SectionTitle>
3.2 A more informed model
</SectionTitle>
      <Paragraph position="0"> In order to postulate suppositions that account for the beliefs in an argument, we expand the basic model to include supposition con gurations (beliefs attributed to the user in addition to or instead of the beliefs shared with the system). Now the model comprises the pair fSCi,IGig, and Equa- null (Recall that suppositions pertain to beliefs only, i.e., they don't have a structural component.) Table 2 summarizes the calculation of these probabilities separated according to model complexity and data t (the elements that differ from the basic model are boldfaced). It also shows the trade-off between belief model complexity and belief data t. Making suppositions has a higher model complexity (lower probability) than not making suppositions (where SCi matches the beliefs in the domain BN). However, as seen in the example in Figure 1, making a supposition that reduces or eliminates the discrepancy between the beliefs in the argument and those in the interpretation increases the belief data- t considerably, at the expense of a more complex belief model.</Paragraph>
    </Section>
    <Section position="4" start_page="137" end_page="138" type="sub_section">
      <SectionTitle>
3.3 Additional background knowledge
</SectionTitle>
      <Paragraph position="0"> An increase in our background knowledge means that we take into account additional factors about the world. This extra knowledge in turn may cause us to prefer interpretations that were previously discarded. We have considered two additions to background knowledge: dialogue history, and users' preferences regarding inference patterns.</Paragraph>
      <Paragraph position="1"> Dialogue history Dialogue history in uences the salience of a node, and hence the probability that it was included in a user's argument. We have modeled salience by means of an activation function that decays with time (Anderson, 1983), and used this function to moderate the probability of including a node in an interpretation (instead of using a uniform distribution). We have experimented with two activation functions: (1) a function where the level of activation of a node is based on the frequency and recency of the direct activation of this node; and (2) a function where the level of activation of a node depends on its similarity with all the (activated) nodes, together with the frequency and recency of their activation (Zukerman and George,  2005).</Paragraph>
      <Paragraph position="2"> To illustrate the in uence of salience, compare the preferred interpretation graph in Figure 1 (in the light gray bubble) with an alternative path through NbourHeard-Green&amp;BodyArgueLastNight and GreenVisit-BodyLastNight. The preferred path has 4 nodes, while the alternative one has 5 nodes, and hence a lower probability. However, if the nodes in the longer path had been recently mentioned, their salience could overcome the size disadvantage.</Paragraph>
      <Paragraph position="3"> Thus, although the chosen interpretation graph may have a worse data t than the smallest graph, it still may have the best overall probability in light of the additional background knowledge.</Paragraph>
    </Section>
    <Section position="5" start_page="138" end_page="139" type="sub_section">
      <SectionTitle>
Inference patterns
</SectionTitle>
      <Paragraph position="0"> In a formative evaluation of an earlier version of our system, we found that people objected to inferences that had increases in certainty or large changes in certainty (Zukerman and George, 2005). An example of an increase in certainty is A [Probably] implies B [VeryProbably].</Paragraph>
      <Paragraph position="1"> A large change in certainty is illustrated by A [VeryProbably] implies B [EvenChance].</Paragraph>
      <Paragraph position="2"> We then conducted another survey to determine the types of inferences considered acceptable by people (from the standpoint of the beliefs in the antecedents and the consequent). The results from our preliminary survey prompted us to distinguish between three types of inferences: Both-Sides, SameSide and AlmostSame.</Paragraph>
      <Paragraph position="3"> BothSides inferences have antecedents with beliefs on both sides of the consequent (in favour and against), e.g., A[VeryProbably] &amp; B[ProbablyNot] implies C[EvenChance].</Paragraph>
      <Paragraph position="4"> All the antecedents in SameSide inferences have beliefs on one side of the consequent, but at least one antecedent has the same belief level as the consequent, e.g., A[VeryProbably] &amp; B[Possibly] implies C[Possibly].</Paragraph>
      <Paragraph position="5"> All the antecedents in AlmostSame inferences have beliefs on one side of the consequent, but the closest antecedent is one level up from the consequent, e.g., A[VeryProbably] &amp; B[Possibly] implies C[EvenChance].</Paragraph>
      <Paragraph position="6"> Our survey contained six evaluation sets, which were done by 50 people. Each set contained an initial statement (we varied the polarity of the statement in the various sets), three alternative arguments that explain this statement, and the option to say that no argument is a good explanation. The respondents were asked to rank these options in order of preference.</Paragraph>
      <Paragraph position="7"> All the evaluation sets contained one argument that was objectionable according to our preliminary survey (there was an increase in belief or a large change in belief from the antecedent to the consequent). The two other arguments, each of which comprises a single inference, were distributed among the six evaluation sets as follows. Three sets had one BothSides inference and one SameSide inference, each with two antecedents. null Two sets had one SameSide inference, and one AlmostSame inference, each with two antecedents. null One set had one SameSide inference with two antecedents, and one BothSides inference comprising three antecedents.</Paragraph>
      <Paragraph position="8"> In order to reduce the effect of the respondents' domain bias, we generated two versions of the survey, where for each evaluation set we swapped the antecedent propositions in one of the inferences with the antecedent propositions in the other. Our survey showed that people prefer BothSides inferences (which contain antecedents for and against the consequent). They also prefer SameSide to AlmostSame for antecedents with beliefs in the negative range (VeryProbNot, ProbNot and PossNot); and they did not distinguish between SameSide and AlmostSame for antecedents with beliefs in the positive range. Further, BothSides inferences with three antecedents were preferred to SameSide inferences with two antecedents. This indicates that persuasiveness carries more weight than parsimony.</Paragraph>
      <Paragraph position="9"> These general preferences are incorporated into our background knowledge as expectations for a range of acceptable beliefs in the consequents of inferences in light of their antecedents. The farther the actual beliefs in the consequents are from the expectations, the lower the probability of these beliefs. Hence, it is no longer true that Pr(bel IGijSCi, struc IGi) = 1 (Section 3.1), as we now have a belief expectation that goes beyond Bayesian propagation. As done at the start of Section 3, the probability of the beliefs in an interpretation is a function of the discrepancy between  these beliefs and expected beliefs. We calculate this probability using a variant of the Zipf distribution adjusted for ranges of beliefs.</Paragraph>
      <Paragraph position="10"> Explanatory extensions are added to an interpretation in order to overcome these belief discrepancies, yielding an expanded model that comprises the tuple fSCi,IGi,EEig. Equation 2 now becomes</Paragraph>
      <Paragraph position="12"> We make simplifying assumptions similar to those made in Section 3.1, i.e., given the interpretation graph and supposition con guration, the beliefs in the argument depend only on the beliefs in the interpretation, and the argument structure depends only on the interpretation structure. These assumptions, together with probabilistic manipulations similar to those performed in Section 3.1,</Paragraph>
      <Paragraph position="14"> Pr(bel IGijSCi, struc IGi, bel EEi, struc EEi) Pr(struc EEijSCi, struc IGi, bel EEi) Pr(bel EEijSCi, struc IGi, struc EEi) Pr(bel ArgjSCi, bel IGi) Pr(struc Argjstruc IGi) The calculation of the probability of an explanatory extension is the same as the calculation for structural model complexity at the start of Section 3. However, the nodes in an explanatory extension are selected from the nodes directly connected to the interpretation graph. In addition, as for the basic model (Section 3.1), the beliefs in the nodes in explanatory extensions are obtained algorithmically by means of Bayesian propagation. Hence, there is no discrepancy with expected beliefs, i.e., Pr(bel EEijSCi, struc IGi, struc EEi) = 1.</Paragraph>
      <Paragraph position="15"> Table 3 summarizes the calculation of these probabilities (the elements that differ from the basic model and the enhanced model are boldfaced). It also shows the trade-off between structural and belief model complexity. Presenting explanatory extensions has a higher structural complexity (lower probability) than not presenting them. However, explanatory extensions can reduce the numerical discrepancy between the beliefs in an interpretation and the beliefs expected from the background knowledge, thereby increasing the belief probability of the interpretation. For instance,   |Pr model structure (IG) )  |Pr struct. data t  |Pr model belief (SC) )  |Pr belief data t  |Pr model structure (EE))  |Pr model belief  in the example in Figure 1, the added explanatory extension eliminates the unacceptable jump in belief. null Table 4 summarizes the trade-offs discussed in this section.</Paragraph>
    </Section>
  </Section>
  <Section position="8" start_page="139" end_page="140" type="metho">
    <SectionTitle>
4 Evaluation
</SectionTitle>
    <Paragraph position="0"> We evaluated separately each component of an interpretation interpretation graph, supposition con guration and explanatory extensions.</Paragraph>
    <Section position="1" start_page="139" end_page="140" type="sub_section">
      <SectionTitle>
4.1 Interpretation graph
</SectionTitle>
      <Paragraph position="0"> We prepared four evaluation sets, each of which was done by about 20 people (Zukerman and George, 2005). In three of the sets, the participants were given a simple argument and a few candidate interpretations (ranked highly by our system). The fourth set featured a complex argument, and only one interpretation (other candidates had much lower probabilities). The participants were asked to give each interpretation a score between 1 (Very UNreasonable) and 5 (Very reasonable).</Paragraph>
      <Paragraph position="1"> Table 5 shows the results obtained for the interpretation selected by our formalism for each set, which was the top scoring interpretation. The rst  Std. dev. 1.45 1.11 1.39 1.02 Stat. sig. (p) 0.08 0.15 0.07 NA  row shows the average score given by our subjects to this interpretation, the second row shows the standard deviation, and the third row the statistical signi cance, derived using a paired Z-test against alternative options (no alternatives were presented for the fourth set). Our results show that the interpretations generated by our system were generally acceptable, but that some people gave low scores. Our subjects' feedback indicated that these scores were mainly due to mismatches between beliefs in the argument and in its interpretation, and due to belief discontinuities. This led to the addition of suppositions and explanatory extensions.</Paragraph>
    </Section>
    <Section position="2" start_page="140" end_page="140" type="sub_section">
      <SectionTitle>
4.2 Supposition con guration
</SectionTitle>
      <Paragraph position="0"> We prepared four evaluation sets, each of which was done by 34 people (George et al., 2005). Each set consisted of a short argument, plus a list of supposition options as follows: (a) four suppositions that had a reasonably high probability according to our formalism, (b) the option to make a free-form supposition in line with the domain BN, and (c) the option to suppose nothing. We then asked our subjects to indicate which of these options was required for the argument to make sense. Specifically, they had to rank their preferred options in order of preference (but they did not have to rank options they disliked). Overall, there was strong support for the supposition preferred by our formalism. In three of the evaluation sets, it was ranked rst by most of the trial subjects (30/34, 19/34, 20/34), with no other option a clear second.</Paragraph>
      <Paragraph position="1"> Only in the fourth set, the supposition preferred by our formalism was equal- rst with another option, but still was ranked rst 10 times (out of 34).</Paragraph>
    </Section>
    <Section position="3" start_page="140" end_page="140" type="sub_section">
      <SectionTitle>
4.3 Explanatory extensions
</SectionTitle>
      <Paragraph position="0"> We constructed two evaluation sets, each of which was done by 20 people. Each set consisted of a short argument and two alternative interpretations (with and without explanatory extensions). There was strong support for the explanatory extensions proposed by our formalism, with 57.5% of our trial subjects favouring the interpretations with explanatory extensions, compared to 37.5% of the subjects who preferred the interpretations without such extensions, and 5% who were indifferent.</Paragraph>
    </Section>
  </Section>
  <Section position="9" start_page="140" end_page="141" type="metho">
    <SectionTitle>
5 Related Research
</SectionTitle>
    <Paragraph position="0"> An important aspect of discourse understanding involves lling in information that was omitted by the interlocutor. In this paper, we have presented a probabilistic formalism that balances con icting factors when lling in three types of information omitted from an argument. Interpretation graphs ll in details in the argument's inferences, supposition con gurations make sense of the beliefs in the argument, and explanatory extensions overcome belief discontinuities.</Paragraph>
    <Paragraph position="1"> Our approach resembles the work of Hobbs et al. (1993) in several respects. They employed an abductive approach where a model (interpretation) is inferred from evidence (sentence); they made assumptions as necessary; and used guiding criteria pertaining to the model and the data for choosing between candidate models. There are also signi cant differences between our work and theirs. Their interpretation focused on problems of reference and disambiguation in single sentences, while ours focuses on a longer discourse and the relations between the propositions therein. This distinction also determines the nature of the task, as they try to nd a concise model that explains as much of the data as possible (e.g., one referent that ts many clues), while we try to nd a representation for a user's argument. Additionally, their domain knowledge is logic-based, while ours is Bayesian; and they used weights to apply their hypothesis selection criteria, while our criteria are embodied in a probabilistic framework.</Paragraph>
    <Paragraph position="2"> Plan recognition systems also generate one or more interpretations of a user's utterances, employing different resources to ll in information omitted by the user, e.g., (Allen and Perrault, 1980; Litman and Allen, 1987; Carberry and Lambert, 1999; Raskutti and Zukerman, 1991). These plan recognition systems used a plan-based approach to propose interpretations. The rst three systems applied different types of heuristics to select an interpretation, while the fourth system used a probabilistic approach moderated by heuristics to select the interpretation with the highest probability. We use a probabilistic domain representation in the form of a BN (rather than plan libraries), and apply a probabilistic mechanism that represents explicitly the contribution of background knowledge, model complexity and data t to the generation of an interpretation. Our mechanism, which can be applied to other domain representations, balances different types of complexities and discrepancies to select the interpretation with the highest posterior probability.</Paragraph>
    <Paragraph position="3"> Several researchers used maximum posterior  probability as the criterion for selecting an interpretation (Charniak and Goldman, 1993; Gertner et al., 1998; Horvitz and Paek, 1999). They used BNs to represent a probability distribution over the set of possible explanations for the observed facts, and selected the explanation (a node in the BN or a value of a node) with the highest probability. We also use BNs as our domain representation, but our explanation of the facts (the user's argument) is a Bayesian subnet (rather than a single node) supplemented by suppositions. Additionally, we calculate the probability of an interpretation on the basis of the t between the argument and the interpretation, and the complexity of the interpretation in light of the background knowledge.</Paragraph>
    <Paragraph position="4"> Our work on positing suppositions is related to research on presuppositions (Kaplan, 1982; Gurney et al., 1997) a type of supposition implied by the wording of a statement. Like our suppositions, presuppositions are necessary to make sense of what is being said, but they operate at a different knowledge level than our suppositions.</Paragraph>
    <Paragraph position="5"> This aspect of our work is also related to research on the recognition of awed plans (Quilici, 1989; Pollack, 1990; Chu-Carroll and Carberry, 2000).</Paragraph>
    <Paragraph position="6"> These researchers used a plan-based approach to identify erroneous beliefs that account for a user's statements or plan, while we use a probabilistic approach. Our approach supports the consideration of many possible options, and integrates suppositions into a broader reasoning context.</Paragraph>
    <Paragraph position="7"> Finally, the research reported in (Joshi et al., 1984; van Beek, 1987; Zukerman and Mc-Conachy, 2001) considers the addition of information to planned discourse to prevent a user's erroneous inferences from this discourse. Our mechanism adds explanatory extensions to an interpretation to prevent inferences that are objectionable due to discontinuities in belief. Since such nonsequiturs may also be present in system-generated arguments, the approach presented here may be incorporated into argument-generation systems.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML