File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/95/p95-1009_metho.xml

Size: 22,718 bytes

Last Modified: 2025-10-06 14:14:00

<?xml version="1.0" standalone="yes"?>
<Paper uid="P95-1009">
  <Title>User-Defined Nonmonotonicity in Unification-Based Formalisms</Title>
  <Section position="4" start_page="63" end_page="64" type="metho">
    <SectionTitle>
3 User-Defined Nonmonotonicity
</SectionTitle>
    <Paragraph position="0"> I will now describe how the work by Young and Rounds can be generalized to allow the user to define nonmonotonic constructions. The main idea in their approach is that each node in a feature structure consists of a nonmonotonic sort. Such sorts can contain two different kinds of information, the ordinary monotonic information and a set of defaults. If we assume that fl is defined as a default in Young and Rounds' work then it is interpreted according to the rule: if it is consistent to believe # then believe #. In Reiter's default logic this is expressed with the following normal default rule.</Paragraph>
    <Paragraph position="2"> In this paper I want to allow the user to use other forms of nonmonotonic inferences and not only the normal default rule given above. Therefore, I will consider the general form of default rules. An intuitive reading of a general default rule is, if a is believed and it is consistent to believe # then believe 7. In default logic this is usually expressed as  The next question is how such defined nonmonotonic rules are going to be interpreted in a unification framework. In (Reiter, 1980), a rule like the one above could be applied whenever a is true and # is consistent with the information we already have.</Paragraph>
    <Paragraph position="3"> If we assume that V represents the information already given this means that the default rule can be applied whenever Y C a and Y I-I # does not yield unification failure. When the rule is applied the new information obtained would be 1/Iq 7.</Paragraph>
    <Paragraph position="4"> In the approach described in this paper, the user is allowed to define the actual nonmonotonic rule that should be used for a particular operation by using the following syntax.</Paragraph>
    <Paragraph position="6"> In the syntax given above name assigns a name to the defined rule, and thus allows the user to use nonmonotonic information when defining linguistic knowledge. The parameters in the rule definition are variables, which can be used within the actual default rule at the end of the description. The user is assumed to assign the nonmonotonic information contained in this rule to his linguistic knowledge by using an expression of the form narne(pararneterl , . . . parametern ).</Paragraph>
    <Paragraph position="7"> The when slot in the rule allows the user to decide when the rule is going to be applied, or in Young and Rounds' terminology, explained. I will make use of two values for the when-slot, immediate and posterior. Immediate means that the nonmonotonic rule is going to be applied each time a full unification task has been solved or whenever all information about an object in the defined inheritance hierarchy has been retrieved. Posterior explanation means that the explanation of the rule is postponed until reaching the result of some external process, for example, a parser or generator. There is however no hinder in excluding the use of other values here. One could, for example, imagine cases where one would want different nonmonotonic rules to be explained after a completed parse, a generation, or after resolving discourse referents.</Paragraph>
    <Paragraph position="8"> Note that although the when slot in the definition of a nonmonotonic rule allows the user to define when his rule is going to be applied we will still have an order independent nonmonotonic unification operator. This is the case because we follow Young and Rounds' approach and separate the unification operation from the explanation of a nonmonotonic rule.</Paragraph>
    <Paragraph position="9"> Therefore, what affects the final result of a computation is when one chooses to explain default rules and not the order of the unification operations occurring between such explanations.</Paragraph>
  </Section>
  <Section position="5" start_page="64" end_page="65" type="metho">
    <SectionTitle>
4 Formal Definitions
</SectionTitle>
    <Paragraph position="0"> In this section I provide give the formal definitions for nonmonotonic sorts and how nonmonotonic sorts are unified and explained. The definitions are generalizations of the definitions in Young and Rounds (1993). The notation a -,~ b is used to denote the fact that a I-1 b does not yield unification failure.</Paragraph>
    <Paragraph position="1"> A nonmonotonic sort is a structure containing both information from the basic subsumption order and information about default rules to be explained at a later point in the computation.</Paragraph>
    <Paragraph position="2"> Definition 1 A nonmonotonic sort is a pair (s, A) where s E S and A is a set of nonmonotonic rules of the form (w, a : fl ==~ 3') where w is an atom and a, fl and 3' E S. It is assumed that for each nonmonotonic rule 3' _C fl, a --, s, fl ,~ s, and 713s C s.</Paragraph>
    <Paragraph position="3"> As seen by the definition a nonmonotonic sort is considered to be a pair of monotonic information from the subsumption order and nonmonotonic information represented as a set of nonmonotonic rules, The user can assign nonmonotonic information to a nonmonotonic sort by calling a nonmonotonic definition as defined in the previous section. The actual nonmonotonic rule occurring within the sort is a pair consisting of the when slot and the last part of the nonmonotonic definition, with the parameter variables instantiated according to the call made by the user.</Paragraph>
    <Paragraph position="4"> The second part of this definition contains some well-foundedness conditions for a nonmonotonic sort. The first condition (3' _C ~) is a restriction similar to the restriction to normal default rules in Reiter's (1980) default logic. This restriction ensures that the application of one default rule will never cause previously applied default rules to be inapplicable. This makes the procedure for application of defaults more efficient and will be further discussed in section 6.</Paragraph>
    <Paragraph position="5"> The next two conditions in the definition, a ,-, s and fl ~ s, guarantee that the default rule is or can be applicable to the nonmonotonic sort. The reason for only checking that a ~ s instead of s C a is that future unifications can restrict the value of s into something more specific than a and thus may make the default rule applicable.</Paragraph>
    <Paragraph position="6"> The last condition on a nonmonotonic sort, 3'Us rs, may seem superfluous. The reason for including it is to ensure that applying the default actually restricts the value of the sort. Otherwise the default rule would have no effect and can be removed. Note in particular that the above conditions on a nonmonotonic sort implies that 7 may be fail.</Paragraph>
    <Paragraph position="7"> Given the unification operation of objects within the subsumption order and the definition of nonmonotonic sorts it is possible to define an operation for nonmonotonic unification.</Paragraph>
    <Paragraph position="8"> Definition 2 The nonmonotonic unification (n~v) of two nonmonotonic sorts (sl, A1) and (s2, A2) is the sort (s, A) where</Paragraph>
    <Paragraph position="10"> A2, a.,~s, ~,..s, andTtqst-s} The nonmonotonic unification is computed by computing the unification of the monotonic parts of the two sorts and then taking the union of their non-monotonic parts. The extra conditions used when forming the union of the nonmonotonic parts of the sorts are the same as in the definition of a nonmonotonic sort and their purpose is to remove nonmonotonic rules that are no longer applicable, or would have no effect when applied to the sort resulting from the unification.</Paragraph>
    <Paragraph position="11"> It is important to note that this generalization of the original definition of nonmonotonic unification from Young and Rounds (1993) preserves the property of order independence for default unification. When using nonmonotonie sorts containing non-monotonic rules, we also need to know how to merge the monotonic and nonmonotonic information within the sort. I will use the terminology w-application for applying one nonmonotonic rule to the sort and w-ezplanation when applying all possible rules.</Paragraph>
    <Paragraph position="12"> Definition 3 The nonmonotonic rule</Paragraph>
    <Paragraph position="14"> The result of the w-application is 3' I'1 s Note that the w in w-application should be considered as a variable. This means that only nonmonotonic rules whose first component is w are considered and that it is possible to choose which nonmonotonic rules should be applied in a particular point at some computation.</Paragraph>
    <Paragraph position="15"> In addition note that the restriction that 7 --in all nonmonotonic rules and the special cases for s = fail ensures that the application of one non-monotonic rule never destroys the applicability of a previously applied rule. This reduces the amount of work required when computing a w-explanation.</Paragraph>
    <Paragraph position="16"> Based on these observations, a sufficient condition for w-explanation is defined as follows.</Paragraph>
    <Paragraph position="17"> Definition 4 t is a w-ezplanation of a nonmono- null tonic sort (s, A) if it can be computed in the following way: 1. If s = fail or no d E A is w-applicable then t = s else 2. Ch,,ose a d = (w, cr : fl =C/, 7) E A such that d is w-applicable to s.</Paragraph>
    <Paragraph position="18"> 3. Let s = sl'lT and go to 1.</Paragraph>
    <Paragraph position="19">  As shown by the definition, a w-explanation is computed by choosing one w-applicable default rule at a time and then applying it. Since the definition of w-applicability and the condition that 7 in all nonmonotonic rules ensures that whenever a nonmonotonic rule is applied it can never be inapplicable, there is no need to check if the preconditions of earlier applied nonmonotonic rules still hold. Note also that the choice of which nonmonotonic rule to apply in each step of a w-explanation is nondeterministic. Consequently, it is possible to have conflicting defaults and multiple w-explanations for a nonmonotonic sort.</Paragraph>
    <Paragraph position="20"> Note also that the result of a w-explanation is allowed to be fail. Another option would be to interpret .fail as if the application of the nonmonotonic rule should not be allowed. However, as seen in the next section, for many uses of nonmonotonic extensions within unification-based formalisms, the aim is to derive failure if the resulting structure does not fulfill some particular conditions. This makes it important to allow fail to be the result of applying a nonmonotonic rule.</Paragraph>
  </Section>
  <Section position="6" start_page="65" end_page="66" type="metho">
    <SectionTitle>
5 Examples
</SectionTitle>
    <Paragraph position="0"> In this section I will show how some of the most common nonmonotonic extensions to unification-based grammar can be expressed by defining rules as above. I will start with defining default values.</Paragraph>
    <Paragraph position="1"> This is done by defining a nonmonotonic rule default for the class value, which is assumed to be the most general class in a defined hierarchy. The rule defined here is treated as the one in (Young and Rounds, 1993).</Paragraph>
    <Paragraph position="2"> class value ; nonmon default(X) :immediate :X =&gt; X.</Paragraph>
    <Paragraph position="3"> This default rule can be used when defining verbs. The rule is used for stating that verbs are active by default. I also define the two Swedish verbs skickade (sent) and skickades (was sent) to illustrate how this  While retrieving the information for these two verbs we will obtain the following two feature structures containing nonmonotonic sorts: For skickade: \[lex: skicka, form: (\[I ,{(immediate, :active active )})3 For skickades: \[lex: skicka, form: (passive,{(immediate, :active ::~ active )})\] Since I have used immediate for the when-slot in the definition of the default rule, this nonmonotonic rule will be applied immediately after retrieving all information about a verb in the hierarchy. For the two structures above, the default rule can be applied for skickade, since active is consistent with D, but not for skickades, since active and passive are inconsistent. The result after applying immediateexplanation to the two structures above is shown below.</Paragraph>
    <Paragraph position="4"> For skickade: \[lex: skicka, form: active\] For skickades: \[lex: skicka, form: passive\] Another nonmonotonic operation that has been used in LFG (Kaplan and Bresnan, 1983) is the value constraint (=e) used to check whether a sub-structure has a particular value after a completed parse. The definition of value constraints as a non-monotonic rule makes use of negation, interpreted as negation as failure.</Paragraph>
    <Paragraph position="6"> One use of value constraints in LFG is to assert a condition that some grammar rules can only be used for passive sentences. I will here assume that a representation for verbs where passive verbs have the value passive for the attribute form, but where other verbs have no value for this attribute. In the syntax used in this paper the constrMnt that a particular grammar rule can only be used for passive verbs would be expressed as below: \[form: =c(passive)\] This would result in the nonmonotonic sort: \[form: (\[\],{(posterior, : ~passive fail )})3 As seen by the definition of =c, the explanation for this nonmonotonic sort is postponed and is assumed to be computed after finding a parse for some sentence. This implies that the only case where this rule would not apply, and thus not give fail as a result, is when the value of form actually is passive. For all other values of form, we would have something that is consistent with ~passive and thus the nonmonotonic rule will derive failure when applied.</Paragraph>
    <Paragraph position="7">  The next nonmonotonic structure I want to discuss is any-values. The inheritance hierarchy is used to be able to define any-values in a simple way.</Paragraph>
    <Paragraph position="8">  class value.</Paragraph>
    <Paragraph position="9"> class none; isa value.</Paragraph>
    <Paragraph position="10"> class any_value; isa value.</Paragraph>
    <Paragraph position="11"> nonmon any():posterior :any_no_value =&gt; fail.</Paragraph>
    <Paragraph position="12"> class any_no_value ; isa any_value.</Paragraph>
    <Paragraph position="13">  In this small hierarchy it is assumed that all possible values of a structure is a subtype of value. We then divide this into none, which represents that a structure cannot have any value and any_value which contains all actual values. The class any_value is then further divided into a class called any_no_value, which only contains this single value, and the actual values of a structure. The class any_no_value should not be used when defining linguistic knowledge. However, when applying the default rule a value that has not been instantiated is compatible with this any_no_value. Therefore the default rule can make the conclusion that the structure is inconsistent, which is what we desire. Note that, as soon as a value has been further instantiated into a 'real' value, it will no longer be consistent with any_no_value, and the nonmonotonic rule cannot apply. Two examples will further illustrate this. The nonmonotonic sort: (0, {( posterior, :any_no_value fail )}) will be posterior-explained to: fail While the sort: (\[lex: kalle\], {( posterior, :any_no_value ~ fail )}) will be posterior-explained to: \[lex : kalle\] The last nonmonotonic operations I want to discuss are completeness and coherence as used in LFG. To be able to define these operations I assume the inheritance hierarchy above, without the nonmonotonic definition of any. I will, instead, make use of the two nonmonotonic definitions below.</Paragraph>
    <Paragraph position="14"> class value; nonmon coherence(A) :immediate : \[A: none\] =&gt; \[A: none\]; nonmon completeness (A) :posterior :\[A: any_no_value\] =&gt; fail.</Paragraph>
    <Paragraph position="15"> The first of these rules is used to check coherence, and the effect is to add the value none to each attribute that has been defined to be relevant for coherence check, but has not been assigned a value in the lexicon. The second rule is used for checking completeness and it works similarly to the any-definition above.</Paragraph>
    <Paragraph position="16"> Finally, I will show how a fragment of a lexicon can be defined according to these rules. Note that in the definition of the transitive verb, the value any_value is given to the appropriate attributes. This means that they are inconsistent with none, and thus, the coherence rule cannot be applied.</Paragraph>
  </Section>
  <Section position="7" start_page="66" end_page="67" type="metho">
    <SectionTitle>
6 Relation to Default Logic
</SectionTitle>
    <Paragraph position="0"> In this section I will discuss the relation of this work to Reiter's (1980) default logic. There will also be some discussion on the computational properties and limitations of the given approach.</Paragraph>
    <Paragraph position="1"> Compared with Reiter's default logic, our notion of nonmonotonic sorts corresponds to default theories. Unification of nonmonotonic sorts would then correspond to merging two default theories into one single theory and our notion of explaining a nonmonotonic sort corresponds to computing the extension of a default theory in default logic.</Paragraph>
    <Paragraph position="2"> In default logic there is often a restriction to normal-default theories since non-normal default theories are not even semi-decidable. The restriction in our nonmonotonic rules that 7 C fl is similar to the restriction into normal default rules and captures the important property, that the application of one nonmonotonic rule should not affect the applicability of previously applied rules. The decidability of the nonmonotonic rules defined here is, however, highly dependant on the given subsumption order.</Paragraph>
    <Paragraph position="3"> In particular it is dependent on having a decidable unification operation and subsumption check.</Paragraph>
    <Paragraph position="4"> As mentioned previously there is with nonmonotonic sorts, as well as normal default logic, a possibility of conflicting defaults and thus multiple nonmonotonic extensions for a structure. One difference is that nonmonotonic sorts allow that the application of a nonmonotonic rule leads to fail, i.e. an inconsistent structure, while default logic does not allow this outcome. However, since fail is allowed as a valid explanation for a nonmonotonic sort, there is, as  for normal default logic, always at least one explanation for a sort.</Paragraph>
    <Paragraph position="5"> The two following examples will illustrate the difference between nonmonotonic rules giving multiple extensions and nonmonotonic rules giving a single explanation fail.</Paragraph>
    <Paragraph position="6">  In example a the application of one rule, does not make the other inapplicable. Thus the only explanation for a structure is achieved by applying both these two rules and results in fail. In example b, however, the application of one of the rules would block the application of the other. Thus, in this case there are two explanations for the structure dependant on which of the rules that has been applied first. Note that even though there is an order dependency on the application order of nonmonotonic rules this does not affect the order independency on nonmonotonic unification between application of nonmonotonic rules.</Paragraph>
    <Paragraph position="7"> Allowing multiple extensions gives a higher computational complexity than allowing only theories with one extension. Since it is the user who defines the actual nonmonotonic theory multiple extensions must be allowed and it must be considered a task for the user to define his theory in the way he prefers.</Paragraph>
  </Section>
  <Section position="8" start_page="67" end_page="67" type="metho">
    <SectionTitle>
7 Improvements of the Approach
</SectionTitle>
    <Paragraph position="0"> I will start with two observations regarding the definitions given in section 3. First, it is possible to generalize these definitions to allow the first component of a nonmonotonic sort to contain substructures that are also nonmonotonic sorts. With the generalized versions of the definitions explanations that simultaneously explain all substructures of a nonmonotonic sort will be considered. Note that the explanation of default rules at one substructure might affect the explanation of rules at other substructures. Therefore the order on which nonmonotonic rules at different substructures are applied is important and all possible application orders must be considered.</Paragraph>
    <Paragraph position="1"> Considering unification of nonmonotonic sorts it is not necessary to simplify the nonmonotonic part of the resulting sort. A = A i U A2 can be defined as an alternative to the given definition. This alternate definition is useful for applications where the simplification of nonmonotonic sorts by each unification is expected to be more expensive than the extra work needed to explain a sort whose nonmonotonic part is not simplified.</Paragraph>
    <Paragraph position="2"> As stated previously, nonmonotonic sorts allow multiple explanations of a nonmonotonic sort. If desired, it would be fairly easy to add priorities to the nonmonotonic rules, and thus induce a preference order on explanations.</Paragraph>
    <Paragraph position="3"> One further example will illustrate that it is also possible to define negation as failure with nonmonotonic rules. An intuitive interpretation of the defined rule below is that if X is believed (1/E X), failure should be derived.</Paragraph>
    <Paragraph position="4"> nonmon not(X) :immediate X =&gt; fail; However, if this definition is to be really useful we must also allow one definition of a nonmonotonic rule to make use of other nonmonotonic rules. In our original definition we said that the nonmonotonic rule above should be applied if Y ,~ --X. This can be generalized to the case where --X is a nonmonotonic rule if we extend the definition of -~ to also mean that the application (or explanation) of the not rule at this node does not yield failure.</Paragraph>
    <Paragraph position="5"> However, this generalization is outside default logic. Therefore, its computational properties are unclear and needs more investigation.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML