XML Viewer - w06-0406

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/06/w06-0406_metho.xml
Size: 33,269 bytes
Last Modified: 2025-10-06 14:10:33
<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-0406">
  <Title>Capturing Disjunction in Lexicalization with Extensible Dependency Grammar</Title>
  <Section position="4" start_page="41" end_page="42" type="metho">
    <SectionTitle>
2 Extensible Dependency Grammar
</SectionTitle>
    <Paragraph position="0"> An informal overview of XDG's core concepts is in order; for a formal description of XDG, however, see (Debusmann and Smolka, 2006; Debusmann et al., 2005). Strictly speaking, XDG is not a grammatical framework, but rather a description language over finite labelled multigraphs that happens to show very convenient properties for the modeling of natural language, among which a remarkable reconciliation between monostratality, on one side, and modularity and extensibility, on the other.</Paragraph>
    <Paragraph position="1"> Most of XDG's strengths stem from its multi-dimensional metaphor (see Fig. 1), whereby an (holistic or multidimensional) XDG analysis consists of a set of concurrent, synchronized, complementary, mutually constraining one-dimensional analyses, each of which is itself a graph sharing the same set of nodes as the other analyses, but having its own type or dimension, i.e., its own edge label and lexical feature types and its own well-formedness constraints. In other words, each 1D analysis has a nature and interpretation of its own, associates each node with one respective instance of a data type of its own (lexical features) and establishes its own relations/edges between nodes using labels and principles of its own.</Paragraph>
    <Paragraph position="2"> That might sound rather autistic at first, but the 1D components of an XDG analysis interact in fact. It is exactly their sharing one same set of nodes, whose sole intrinsic property is identity, that provides the substratum for interdimensional communication, or rather, mutual constraining.</Paragraph>
    <Paragraph position="3"> html  laugh.&amp;quot; according to grammar Chorus.ul deployed with the XDK  That is chiefly achieved by means of two devices, namely: multidimensional principles and lexical synchronization.</Paragraph>
    <Paragraph position="4"> Multidimensional principles. Principles are reusable, usually parametric constraint predicates used to define grammars and their dimensions.</Paragraph>
    <Paragraph position="5"> Those posing constraints between two or more 1D analyses are said multidimensional. For example, the XDK library provides a host of linking principles, one of whose main applications is to regulate the relationship between semantic arguments and syntactic roles according to lexical specifications. The framework allows lexical entries to contain features of the type lab(D1) - {lab(D2)}, i.e. mappings from edge labels in dimension D1 to sets of edge labels in D2. Therefore, lexical entries specifying {pat -{subj}} might be characteristic of unaccusative verbs, while those with {agt -{subj}, pat -{obj}} would suit a class of transitive ones. Linking principles pose constraints taking this kind of features into account. null Lexical synchronization. The lexicon component in XDG is specified in two steps: first, each dimension declares its own lexicon entry type; next, once all dimensions have been declared, lexicon entries are provided, each specifying the values for features on all dimensions. Finally, at run-time it is required of well-formed analyses that there should be at least one valid assignment of lexicon entries to nodes such that all principles are satisfied. In other words, every node must be assigned a lexicon entry that simultaneously satisfies all principles on all dimensions, for which reason the lexicon is said to synchronize all 1D components of an XDG analysis. Lexical synchronization is a major source of propagation.</Paragraph>
    <Paragraph position="6"> Figure 2 presents a sample 5D XDG analysis involving the most standard dimensions in XDG practice and jargon, namely (i) PA, capturing predicate argument structure; (ii) SC, capturing the scopes of quantifiers; (iii) DS, for deep syntax, i.e. syntactic structure modulo control and raising phenomena; (iv) ID, for immediate dominance in surface syntax (as opposed to DS); and (v) LP, for linear precedence, i.e. a structure tightly related to ID working as a substratum for constraints on the order of utterance of words. In fact, among these dimensions LP is the only one actually to involve a concept of order. PA and DS, in turn, are the only ones not constrained to be trees, but directed acyclic graphs instead. Further details on the meaning of all these dimensions, as well as the interactions between them, would be beyond the scope of this paper and have been dealt with elsewhere. From Section 3 on we shall focus on PA and, to a lesser extent, the dimension with which it interfaces directly: DS.</Paragraph>
    <Paragraph position="7"> Emulating deletion. Figure 2 also illustrates the rather widespread technique of deletion, there applied to infinitival &amp;quot;to&amp;quot; on dimensions DS, PA, and SC. As XDG is an eminently monostratal and thus non-transformational framework, &amp;quot;deletion&amp;quot; herein refers to an emulation thereof. According to this technique, whenever a node has but one incoming edge with a reserved label, say del, on dimension D it is considered as virtually deleted on D. In addition, one artificial root node is postulated from which emerge as many del edges as required on all dimensions. The trick also comes in handy when tackling, for instance, multiword expressions (Debusmann, 2004), which involve worthy syntactic nodes that conceptually have no semantic counterparts.</Paragraph>
  </Section>
  <Section position="5" start_page="42" end_page="48" type="metho">
    <SectionTitle>
3 Modelling Lexicalization Disjunction
</SectionTitle>
    <Paragraph position="0"> in XDG Generation input. Having revised the basics of XDG, it is worth mentioning that so far it has been used mostly for parsing, in which case the input type is usually rather straightforward, namely typewritten sentences or possibly other text units. Model creation is also very simple in parsing and consists of (i) creating exactly one node for each input token, all nodes being instances of one single homogeneous feature structure type automatically inferred from the grammar definition, (ii) making each node select from all the lexical entries indexed by its respective token, (iii) posing constraints automatically generated from the principles found in the grammar definition and (iv) deterministically assigning values to the orderrelated variables in nodes so as to reflect the actual order of tokens in input.</Paragraph>
    <Paragraph position="1"> As concerns generation, things are not so clear, though. For a start, take input, which usually varies across applications and systems, not to mention the fact that representability and computability of meaning in general are open issues. Model creation should follow closely, as it is a direct function of input. Notwithstanding, we can  to some extent and advantage tell what generation input is not. Under the hypothesis of an XDG-based generation system tackling lexicalization, input is not likely to contain some direct representation of fully specified PA analyses, much though this is usually regarded as a satisfactory output for a parsing system (!). What happens here is that generating an input PA analysis would presuppose lexicalization having already been carried out. In other words, PA analyses accounting for e.g. &amp;quot;a ballerina&amp;quot; and &amp;quot;a dancing female human being&amp;quot; have absolutely nothing to do with each other whereas what we wish is exactly to feed input allowing both realizations. Therefore, PA analyses are themselves part of generation output and are acceptable as parsing output inasmuch as &amp;quot;de-lexicalization&amp;quot; is considered a trivial task, which is not necessarily true, however.</Paragraph>
    <Paragraph position="2"> Although our system still lacks a comprehensive specification of input format and semantics, we have already established on the basis of the above rationale that our original PA predicates must be decomposed into simpler, primitive predicates that expose their inter-relations. For the purpose of the present discussion, we understand that it suffices to specify that our input will contain flat first-order logic-like conjunctions such as [?]x(dance(x)[?]female(x)[?]human(x)), in order to characterize entities, even if the final accepted language is sure to have a stricter logic component than first-order logic and might involve crossings with yet other formalisms. Predicates, fortunately, are not necessarily unary; and, for example, &amp;quot;A ballerina tapped a lovely she-dog&amp;quot; might well be generated from the following input:</Paragraph>
    <Paragraph position="4"> Deletion as the substance of disjunction. Naturally, simply creating one node for each input semantic literal is not at all the idea behind our model. For example, if &amp;quot;woman&amp;quot; is to be actually employed in a specific lexicalization task, then it should continue figuring as one single node in XDG analyses as usual in spite of potentially covering a complex of literals. In fact, XDG and, in specific, PA analyses should behave and resemble much the same as they used to.</Paragraph>
    <Paragraph position="5"> However, one remarkable difference of analyses in our generation model as compared to parsing lies in the role and scope of deletion, which indeed constitutes the very substance of disjunction now. By assigning all nodes but the root one extra lexical entry synchronizing deletion on all dimensions2, we build an unrestrained form of disjunction whereby whole sets of nodes may as well act as if not taking part in the solution. Now it is possible to create nodes at will, even one for each applicable lexical item, and rely on the fact that, many ill-formed outputs as the set of all solutions may contain, it still covers all correct paraphrases, i.e. those in which all and only the right nodes have been deleted. For example, should one node be created for each of &amp;quot;ballerina&amp;quot;, &amp;quot;woman&amp;quot;, &amp;quot;dancer&amp;quot;, &amp;quot;dancing&amp;quot;, &amp;quot;female&amp;quot; and &amp;quot;person&amp;quot;, all possible combinations of these words, including the correct ones, are sure to be generated.</Paragraph>
    <Paragraph position="6"> Our design obviously needs further constraining, yet the general picture should be visible by now that we really intend to finish model creation -- or rather, start search -- with (i) a bunch of perfectly floating nodes in that not one edge is given at this time, all of which are equally willing and often going to be deleted, and (ii) a bunch of constraints to rule out ill-formed output and provide for efficiency. There are two main gaps in this summary, namely: * what these constraints are and * how exactly nodes are to be created.</Paragraph>
    <Paragraph position="7"> This paper restricts itself to the first question. The second one involves issues beyond lexicalization, actually permeating all generation tasks, and is currently our research priority. Consequently, in all our experiments most of model creation was handcrafted.</Paragraph>
    <Paragraph position="8"> In the name of clarity, we shall hereafter abstract over deletion, that is to say we shall in all respects adhere to the illusion of deletion, that nodes may cease to exist. In specific, whenever we refer to the sisters, daughters and mothers of a node, we mean those not due to deletion. In other words, all happens as if deleted nodes had no relation whatsoever to any other node. This abstraction is extremely helpful and is actually employed in our implementation, as shown in Section 4.</Paragraph>
    <Section position="1" start_page="44" end_page="48" type="sub_section">
      <SectionTitle>
3.1 How Nodes Relate
</SectionTitle>
      <Paragraph position="0"> In the following description, we shall mostly restrict ourselves to what is novel in our model as compared to current practice in XDG modelling.</Paragraph>
      <Paragraph position="1"> Therefore, we shall emphasize dimension PA and the new constraints we had to introduce in order to have only the desired PA analyses emerge. Except for sparse remarks on dimension DS and its relationship with PA, which we shall also discuss briefly, we assume without further mention the concurrence of other XDG dimensions, principles and concepts (e.g. lexical synchronization) in any actual application of our model.</Paragraph>
      <Paragraph position="2"> Referents, arguments and so nodes meet. For the most part, ruling out ill-formed output concerns posing constraints on acceptable edges, especially when one takes into account that all we have is some floating nodes to start with. Let us first recall that dimension PA is all about predicate arguments, which are necessarily variables thanks to the flat nature of our input semantics. Roughly speaking, each PA edge relates a predicate with one of its arguments and thus &amp;quot;is about&amp;quot; one single variable. Therefore, our first concern must be to ensure that every PA edge should land on a node that &amp;quot;is (also) about&amp;quot; the same variable as the edge itself.</Paragraph>
      <Paragraph position="3"> In order to provide for such an &amp;quot;aboutness&amp;quot; agreement, so to speak, one must first provide for &amp;quot;aboutness&amp;quot; itself. Thus, we postulate that every node should now have two new features, namely (i) hook, identifying the referent of the node, i.e. the variable it is about, and (ii) holes, mapping every PA edge label lscript into the argument (a variable) every possible lscript-labelled outgoing edge should be about. Normally these features should be lexicalized. The coincidence with Copestake et al.'s terminology (Copestake et al., 2001) is not casual; in fact, our formulation can be regarded as a decoupled fragment of theirs, since neither our holes involves syntactic labels nor are scopal issues ever touched. As usual in XDG, we leave it for other modules such as mentioned in the previous section to take charge of scope and the relationship between semantic arguments and syntactic roles. The role of these new features is depicted in Figure 3, in which an arrow does not mean an edge but the possibility of establishing edges.</Paragraph>
      <Paragraph position="4"> Completeness and compositionality. Next we proceed to ensure completeness, i.e. that every so-Figure 3: For every node v and on top of e.g. valency constraints, features hook and holes further constrain the set of nodes able to receive edges from v for each specific edge label.</Paragraph>
      <Paragraph position="5"> lution should convey the whole intended semantic content. To this end, nodes must have features holding semantic information, the most basic of which is bsem, standing for base semantic content, or rather, the semantic contribution a lexical entry may make on its own to the whole.</Paragraph>
      <Paragraph position="6"> For example, &amp;quot;woman&amp;quot; might be said to contribute lx.female(x)[?]human(x), while &amp;quot;female&amp;quot;, only lx. female(x). Normally bsem should be lexicalized. null In addition, we postulate feature sem for holding the actual semantic content of nodes, which should not be lexicalized, but rather calculated by a principle imposing semantic compositionality. In our rather straightforward formulation, for every node v, sem(v) is but the conjunction of bsem(v) and the sems of all its PA daughters thus:</Paragraph>
      <Paragraph position="8"> (2) where v lscript[?]-D u denotes that node u is a daughter of v on dimension D through an edge labelled lscript (the absence of the label just denotes that it does not matter).</Paragraph>
      <Paragraph position="9"> Finally, completeness is imposed by means of node feature axiom, upon which holds the invari-</Paragraph>
      <Paragraph position="11"> for every node v. The idea is to have axiom as a lexicalized feature and consistently assign it the neutralizing constant true for all lexical entries but those meant for the root node, in which case the value should equal the intended semantic content. null Coreference classes, concentrators and revisions to dimensions PA and DS. The unavoid- null able impediment to propagation is intrinsic choice, i.e. that between things equivalent and that we wish to remain so. That is exactly what we would like to capture for lexicalization while attempting to make the greatest amount of determinacy available to minimize failure. To this end, our strategy is to make PA analyses as flat as possible, with coreferent nodes -- i.e. having the same referent or hook -- organizing in plexuses around, or rather, directly below hopefully one single node per plexus, thus said to be a concentrator. This offers advantages such as the following:  1. the number of leaf nodes is maximized, whose sem features are determinate and equals their respective bsems; 2. coreferent nodes tend to be either potential sisters below a concentrator or deleted. This  allows most constraints to be stated in terms of direct relationships of mother-, daughteror sisterhood. Such proximity and concentration is rather opportune because we are dealing simply with potential relationships as nodes will usually be deleted. In other words, our constraints aim mostly at ruling out undesired relations rather than establishing correct ones. The latter must remain a matter of choice.</Paragraph>
      <Paragraph position="12"> It is in order to define which are the best candidates for concentrators. Having different concentrators in equivalent alternative realizations, such as &amp;quot;a ballerina&amp;quot;, &amp;quot;a female dancer&amp;quot; or &amp;quot;a dancing woman&amp;quot; (hypothetical concentrators are underlined), would be rather hampering, since the task of assigning &amp;quot;concentratingness&amp;quot; would then be fatally bound to lexicalization disjunction itself and not much determinacy could possibly be derived ahead of committing to this or that realization. In face of that, the natural candidate must be something that remains constant all along, namely the article. Certainly, what specific article and, among others, whether to generate a definite/anaphoric or indefinite/first-time referring expression is also a matter of choice, but not pertaining to lexicalization. For the sake of simplicity and scope, let us stick to the case of indefinite articles, keeping in mind that possible extensions to our model to cope with (especially definite anaphoric) referring expression generation shall certainly re- null Electing articles for concentrators means that they now directly dominate their respective nouns and accompanying modifiers on dimension PA as shown in Figure 4 for &amp;quot;a dancing female person&amp;quot;. One new edge label apply is postulated to connect concentrators with their complements, the following invariants holding:  1. for every node v, hook(v) = holes(v)(apply), i.e. only coreferent nodes are linked by apply edges; 2. every concentrator lexical entry provides a  valency allowing any number of outgoing apply edges, though requiring at least one.</Paragraph>
      <Paragraph position="13"> Roughly speaking, the intuition behind this new PA design is that the occurrence of a lexical (as opposed to grammatical) word corresponds to the evaluation of a lambda expression, resulting in a fresh unary predicate built from the basesem of the word/node and the sems of its children.</Paragraph>
      <Paragraph position="14"> In turn, every apply edge denotes the application of one such predicate to the variable/referent of a concentrator. In fact, even verbs might be treated analogously if Infl constituents were modelled, constituting the concentrators of verb base forms. Also useful is the intuition that PA abstracts over most morphosyntactic oppositions, such as that between nouns and adjectives, which figure as equals there. The subordination of the latter word class to the former becomes a strictly syntactic phenomenon or, in any case, other dimensions' affairs.</Paragraph>
      <Paragraph position="15"> Dimension DS is all about such oppositions, however, and should remain much the same except that the design is rather simplified if DS maintains concentrator dominance. As a result, articles must stand as heads of noun -- or rather, de- null ing nodes in the lexicalization of &amp;quot;a ballerina&amp;quot; and its paraphrases terminer -- phrases, which is not an unheard-of approach, just unprecedented in XDG. Naturally, standard syntactic structures should appear below determiners, as exemplified in Figure 4. Granted this, the flatness of PA and its relation to DS can straightforwardly be accomplished by the application of XDK library principles Climbing, whereby PA is constrained to be a flattening of DS, and Barriers, whereby concentrators are set as obstacles to climbing by means of special lexical features. Figure 5 thus illustrates the starting conditions for the lexicalization of &amp;quot;a ballerina&amp;quot; and its paraphrases, including the bsems of nodes. Notice that we have created distinct nodes for different parts of speech of one same word, &amp;quot;female&amp;quot;. The relevance of this measure shall be clarified along this section as we develop this example.</Paragraph>
      <Paragraph position="16"> Fighting over-redundancy. We currently employ two constraints to avoid over-redundancy.</Paragraph>
      <Paragraph position="17"> The first is complete in that its declarative semantics already sums up all we desire to express in that matter, while the other is redundant, incomplete, but supplied to improve propagation.</Paragraph>
      <Paragraph position="18"> The complete constraint is imposed between every node and each of its potential daughters.</Paragraph>
      <Paragraph position="19"> Apart from overhead reasons, it might as well be imposed between every pair of nodes. However, the set of potential daughters of a node v is best approximated by function dcands thus:</Paragraph>
      <Paragraph position="21"> where &lt;x&gt; denotes the coreference class of variable x; and ran(f), the range of function f. It is worth noticing that in generation dcands is known at model creation.</Paragraph>
      <Paragraph position="22"> Given a node u and a potential daughter v [?] dcands(u), this constraint involves hypothesizing what the actual semantic content of u would be like if v were not among its daughters.</Paragraph>
      <Paragraph position="23"> Let hdsv(u) and hsemv(u) be respectively the hypothetical set of daughters of u counting v out and its &amp;quot;actual&amp;quot; semantic content in that case, which can be defined thus:</Paragraph>
      <Paragraph position="25"> The constraint consists of ensuring that, if the actual semantic content of the potential daughter v would be subsumed by the hypothetical semantic content of u, then v can never be a daughter of u.</Paragraph>
      <Paragraph position="26"> In other words, each daughter of u must make a difference. Formally, we have the following:</Paragraph>
      <Paragraph position="28"> where the two implication symbols, = and have the same interpretation in this logic statement, but are nonetheless distinguished because their implementations are radically different as shall be discussed in Section 4. Constraint (5) is especially active after some choices have been made. Suppose, in our &amp;quot;a ballerina&amp;quot; example, that &amp;quot;dancing&amp;quot; is the only word selected so far for lexicalization. Let u and v be respectively the nodes for &amp;quot;a&amp;quot; and &amp;quot;dancing&amp;quot;. In this case, the consequent in (5) is false and so must be the antecedent hsemv(u) = dance(x), which implies that hsemv(u) can never &amp;quot;contain&amp;quot; the literal dance(x). From (4) and the fact that articles have neutral base semantics -- i.e. bsem(u) = true -it follows that all further daughters of u must not imply dance(x). As that does not hold for &amp;quot;ballerina&amp;quot; and &amp;quot;dancer&amp;quot;, these nodes are ruled out as daughters of u and thus deleted for lack of mothers. Conversely, if &amp;quot;ballerina&amp;quot; had been selected  in the first place, (5) would trivially detect the redundancy of all other words and analogously entail their deletion.</Paragraph>
      <Paragraph position="29"> In turn, the redundant constraint ensures that, for every pair of coreferent nodes u and v [?] &lt;upvar(u)&gt; , if the actual semantic content of v is subsumed by u, then they can never be sisters.</Paragraph>
      <Paragraph position="30"> Formally:</Paragraph>
      <Paragraph position="32"> This constraint is remarkable for being active even in the absence of choice since it is established between potential sisters, which usually have their sems sufficiently, if not completely, determined.</Paragraph>
      <Paragraph position="33"> Surprisingly enough, the main effect of (6) is on syntax, by constraining alliances on DS. As our new version of the XDK's Climbing principle is now aware of sisterhood constraints, it will constrain every node on PA to have as a mother on DS either its current PA mother or some node belonging to one of its PA sister trees3. In ground terms, when (6) detects that e.g. &amp;quot;woman&amp;quot; subsumes &amp;quot;female (adj./n.)&amp;quot; and constrains them not to be sisters on PA, the Climbing principle will rule out &amp;quot;woman&amp;quot; as a potential DS mother of &amp;quot;female (adj.)&amp;quot;. It is worth mentioning that once v /[?] sistersD(u) is imposed, our sisterhood constraints entail u /[?] sistersD(v).</Paragraph>
      <Paragraph position="34"> Redundant compositionality constraints. Although a complete statement of semantic compositionality is given by Equation 2, we introduce two redundant constraints to improve propagation.</Paragraph>
      <Paragraph position="35"> The first of them attempts to advance detection of nodes whose semantic contribution is strictly required even before the sem features of their mothers become sufficiently constrained. It does so by means of an strategy analogous to that of (5), namely by hypothesizing, for every node v, what the total semantic content would be like if v were deleted. Let root, hdownv(u) and htotsemv be respectively the root node, the set of nodes directly or indirectly below u counting v out, and the total semantic content supposing v is deleted, which can be defined thus:</Paragraph>
      <Paragraph position="37"> The constraint can be formally expressed thus:</Paragraph>
      <Paragraph position="39"> Unfortunately, (7) is not of much use in our current example, better applying to cases where there are a greater number of alternative inner nodes. For example, in the lexicalization of (1), this constraint was immediately able to infer that &amp;quot;lovely&amp;quot; must not be deleted since it was the sole node contributing lovely(y).</Paragraph>
      <Paragraph position="40"> The second redundant compositionality constraint attempts to advance detection of nodes not counting on enough potential sisters to fulfill the actual semantic content of their mothers. To this end, for every node v, the following constraint is imposed:</Paragraph>
      <Paragraph position="42"> braceleftbigg [?], iff v is deleted on D sistersD(v)[?]{v}, else. (9) which reads &amp;quot;the actual semantic content of the mothers of a node is equal to their base semantic content in conjunction with the actual semantic content of this node and its sisters&amp;quot;. It is worth noticing that, when v is deleted, both {u : u[?]-PA v} and eqsisPA(v) become empty so that (8) still holds. This constraint is especially interesting because our new versions of principles Climbing and Barriers, which hold between DS and PA, propagate sisters constraints in both directions. In association with (6) and (8), these principles promote an interesting interplay between syntax and semantics. Resuming our example, let v be node &amp;quot;female (n.)&amp;quot;. Before any selection is performed, constraint (6) infers that only &amp;quot;dancing&amp;quot;, &amp;quot;person&amp;quot; and &amp;quot;dancer&amp;quot; can be sisters to v on PA and thus (now due to Climbing) daughters to v on DS. They cannot be mothers to v because its valency on DS and Climbing are enough to establish that, if v has any mother at all on DS, it is &amp;quot;a&amp;quot;. Again taking the DS valency of v into account, it is possible to infer that, if v has any daughter at all on DS, it is &amp;quot;dancing&amp;quot;, i.e. the only adjective in the original set of candidate  daughters. It is the new sisterhood-aware version of Barriers that propagates this new piece of information back to PA. This principle now knows that the sisters of v on PA must come from either (i) the tree below v on DS, (ii) one of its DS sister trees or (iii) some DS tree whose root belongs to eqsisDS(inter) for some node inter appearing -- on DS -- between v and one of its mothers on PA. In our example, (ii) and (iii) are known to be empty sets, while (i) is at most &amp;quot;dancing&amp;quot;. Consequently, &amp;quot;dancing&amp;quot; is the only potential PA sister of v. Now (8) is finally able to contribute. As &amp;quot;a&amp;quot; is the only possible DS mother of v and any article has empty basic semantics, one is entitled to equate logicalandtext{sem(u) : u[?]-PA v} tologicalandtext {sem(u) : u [?] eqsisPA(v)}. Even though it is not known whether v will ever have mothers or daughters, (8) knows that the left-hand side of the equation yields either the whole intended semantics or nothing, while the right-hand side yields either nothing or at most dance(x) [?] female(x).</Paragraph>
      <Paragraph position="43"> Therefore, the only solution to the equation is nothing on both sides, implying that eqsisPA(v) is empty and thus v is deleted by definition (9).</Paragraph>
      <Paragraph position="44"> Such strong interplay is only possible because we have created distinct nodes for the different parts of speech -- or rather, the two different DS valencies -- of &amp;quot;female&amp;quot;. With somewhat more complicated, heavier constraints it would be possible to have the same propagation for one single node selecting from different parts of speech.</Paragraph>
      <Paragraph position="45"> Notwithstanding, that does not seem worth the effort because a model creation algorithm would be perfectly able to detect the diverging DS valencies, create as many nodes as needed and distribute the right lexical entries among them.</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="48" end_page="48" type="metho">
    <SectionTitle>
4 Implementation and Performance
Remarks
</SectionTitle>
    <Paragraph position="0"> The ideas presented in Section 3 were fully implemented in a development branch of the XDK.</Paragraph>
    <Paragraph position="1"> As with the original XDK, all development is based on the multiparadigm programming system Mozart4.</Paragraph>
    <Paragraph position="2"> The implementation closely follows the original CP approach of the XDK and strongly reflects the constraints we have presented after some rather standard transformations to CP, namely:  human(x) and tap(e,x,y), are encoded as integer values. Features bsem/sem are implemented as set constants/variables of such integers; * logic conjunction [?] is thus modelled by set union [?]. Each &amp;quot;big&amp;quot; conjunction is reduced to the form logicalandtext{f(v) : v [?] V}, where V is a set variable of integer-encoded node identifiers, and modelled by a union selection constraint uniontext&lt;f(1) f(2) ...f(M)&gt; [V], where M is the maximum node identifier and which constrains its result -- a set variable -- to be the union of f(v) for all v [?] V ; * implications of the form x = y are implemented as y [?] x, while those of the form x - y as reify(x) [?] reify(y), where the result of reify(x) is an integer-encoded boolean variable constrained to coincide with the truth-value of expression x.</Paragraph>
    <Paragraph position="3"> Our branch of the XDK now counts on two new principles, namely (i) Delete, which requires the Graph principle, creates doubles for the node attributes introduced by the latter, providing the illusion of deletion, and introduces features for sisterhood constraints; and (ii) Compsem, imposing all constraints described in Section 3.</Paragraph>
    <Paragraph position="4"> A few preliminary proof-of-concept experiments were carried out with input similar to (1) and linguistically and combinatorially analogous to our &amp;quot;ballerina&amp;quot; example. In all of them, the system was able to generate all paraphrases with no failed state (backtracking) in search, which means that propagation was maximal for all cases. Although our design supports more complex linguistic constructs such as relative clauses and preposition phrases and is expected to behave similarly for those cases, we have not made any such experiments so far. This is so because we are currently prioritizing the issue of model creation and coverage of other generation tasks.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML