File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/98/w98-0903_metho.xml

Size: 13,262 bytes

Last Modified: 2025-10-06 14:15:09

<?xml version="1.0" standalone="yes"?>
<Paper uid="W98-0903">
  <Title>Three Formal Extensions to Primitive Optimality Theory</Title>
  <Section position="3" start_page="0" end_page="21" type="metho">
    <SectionTitle>
2 Existential Implication
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="21" type="sub_section">
      <SectionTitle>
2.1 Motivation
</SectionTitle>
      <Paragraph position="0"> OWP as described in Eisner (1997a) provides some support for correspondence constraints (input-output only). These may be defined by means of implication constraints of the form P --4 P or P --+ P, which can be interpreted as requiring, in the first case, that each surface constituent representing property P be aligned with an underlying constituent representing that property, and in the second case that every underlying constituent representing property P be aligned with a surface constituent representing that property. Constraints of this type may be employed to require correspondence between the underlying representation and the surface representation where corresponding constituents must be aligned with one another. However, natural languages also seem to follow weaker constraints requiring only that for each underlying constituent there be a corresponding surface constituent, regardless of the position of that constituent relative to its position in the underlying representation. For example, in Sanskrit roots with at least two voiced stops, where the root ends in a voiced aspirated stop, the underlying aspiration of the root-final stop can be realized upon that stop in the surface representation only when the root is followed by a suffix beginning with a vocoid or a nasal (data from Whitney (1889)):  /da,9a/ reach to \[dagfli~janti\] (Fut.) /bud~/ know, wake \[bod~i\] (Aor.) /dab~/ harm \[dab~ati\] (Pres.) Otherwise, the aspiration is realized on the preceding stop: /dag~/ reach to \[d~ak\] (root noun) /buda/ know, wake \[baut\] (root noun) /dab~/ harm \[dfiap\] (root noun) In these forms it is clear that aspiration is being preserved, but that it is surfacing in a position that cannot overlap with the underlying form. Another example is the Bantu language Chizigula (Kenstowicz and Kisseberth, 1988), in which roots with underlying high vowels appear on the surface with a single high tone in the penultimate syllable of the word, where this syllable could belong to a suffix. Additionally, if a prefix with an underlying high tone is prefixed to a root with no underlying high tone, the high tone of the prefix appears in the penultimate syllable of the resulting word. The existence of a high tone in the underlying form implies the existence of a high tone in the surface form, but the position where that high tone occurs in the underlying form has nothing to do with where the tone appears on the surface.</Paragraph>
    </Section>
    <Section position="2" start_page="21" end_page="21" type="sub_section">
      <SectionTitle>
2.2 Formalization
</SectionTitle>
      <Paragraph position="0"> Existential implication constraints can be used to drive correspondence effects such as the above. These constraints take the form &amp;quot;If X appears within domain D, then Y must appear within D as well.&amp;quot; Using the terms of OTP, this family of constraints can be written as a'l A...A am ~ /31 V ...V/3n/9,1 A ...A 9,p-Here each a'i or /3j is a constituent interior or edge, one of tier, \]tier, or \[tier, where tier represents a constituent type, and each 9,k must be a constituent interior (tier). The constraint represented by this notation outputs a violation for each domain 9,, where 9' represents the intersection of the domains 9,k, in which the time slice represented by the oq occurs, but no/3j occurs. Using the FST notation of Eisner (1997a), the implementation for this constraint would be the following FST:</Paragraph>
      <Paragraph position="2"> (in all 7k)),&amp;quot; N represents &amp;quot;((in all 7k) A -~(in all ai) A -~(in some/3j)),&amp;quot; B represents &amp;quot;((in all 9,k) A (in some/3j)),&amp;quot; A represents &amp;quot;((in all 7k) A  (in all cq) A -~(in some/3j)),&amp;quot; \[ represents &amp;quot;((in or begin all 9,k) - (in all 9,k)),&amp;quot; and \] represents &amp;quot;((in or end all 9,k) - (in all 9,k)).&amp;quot; That is, the machine moves from state S to state 1 if the domain 9, is entered. It moves from there back to state S if the end of the domain appears before cv does, or if any/3 appears. If a appears, the machine moves from state 1 to state 2. From state 2, if/3 appears, the machine returns to the start state without outputting a violation, but if the end of the domain appears without any/3 having appeared, the machine outputs a violation.</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="21" end_page="22" type="metho">
    <SectionTitle>
3 Constraint Disjunction
</SectionTitle>
    <Paragraph position="0"> Crowhurst and Hewitt (1995) cite a number of instances in which it appears that multiple simple constraints must be combined via disjunction (there called conjunction) into complex constraints. Here a simple constraint is a function that takes an input, surface pair as its input and returns true if a particular disallowed phonological structure or lack of correspondence is present in the pair, otherwise false. A constraint disjunction would thus be a function that returns the disjunction of the outputs of its component constraints. Thus a constraint defined by disjunction of component constraints outputs a violation whenever any one of its components does.</Paragraph>
    <Paragraph position="1"> Formalization of constraint disjunction requires reference only to intersection of weighted finite state machines. Specifically, if constraint Cx is defined as a weighted finite state machine T1 = (~E1, ~2,Qi, F1,81, Ex), where E1 is  the alphabet of labels, E2 is the alphabet of weights, drawn from the natural numbers, Q1 is the set of states of the machine, F1 C Q1 is the final states, Sl is the start state, and E1 C Q1 x Y.q x Z,2 x Q1 is the set of edges, and constraint C2 is another weighted deterministic finite state machine T2 -- (~1, ~2, Q2, F2, s2, E2), then the disjunction of the two constraints may be defined as follows:</Paragraph>
    <Paragraph position="3"> A possible notation for the disjunction of two constraints C1 and C2 is C1 v C2, for example &amp;quot;(yce --+ vce) V (cont --+ cont)&amp;quot;.</Paragraph>
    <Paragraph position="4"> A similar concept is that of mutually unranked primitive constraints. For any given input, a complex constraint defined as a group of mutually unranked primitive constraints returns the sum of the violations that the primitive constraints returned. Although it has been argued that the formal power provided by allowing new constraints to be defined by grouping mutually unranked primitive constraints is too great, constraints so defined are fairly prevalent in the literature. For example, Steriade (1996) makes use of a constraint Paradigm Uniformity (PU) Stress which requires that all features within stressed syllables in-one member of a paradigm must be preserved in the corresponding syllable of other members of that paradigm. PU Stress is equivalent to a set of mutually unranked paradigm uniformity constraints for all phonological features. The empirical prediction of PU Stress is that changes in any one feature are as important as changes in any other. If PU Stress were instead to be considered a block of ranked constraints for the individual features, the prediction would be that in the comparison between one candidate in which the top-ranked feature is identical between stressed syllables of the paradigm members, but all other features are different, and another candidate in which only a lower-ranked feature is different, the first candidate would prevail. The data seems to bear out the prediction of the definition using mutually unranked constraints. Another possible definition of PU Stress would be to make use of constraint disjunction. In this definition, all features would be equally important, but the number of non-identical features would not matter--candidates differing in three features would be equal to candidates differing in one feature. Once again, the definition using mutually unranked constraints seems better borne out by the data.</Paragraph>
    <Paragraph position="5"> Leaving aside constraints such as PU Stress, we will see that complex constraints defined as combinations of mutually unranked constraints are useful as inputs to local constraint conjunctions. The formal definition of a complex constraint in terms of mutually unranked subconstraints is identical to the definition of a constraint disjunction, except that the weight n of a new edge is defined as the sum of the weights of the input edges nl and n2 rather than the disjunction: T = (El, E:, Q1 x Q2, F1 x F2, (sl, s2), E), ((q1,1, q2,1), a, n, (ql,2, q2.2)) E E iff (ql,1,al, nl, ql,2) E E1A (q2,1, a2, n2, q:,2) E E2A a 1 N a2 ---- aA A possible notation for a complex constraint C combining mutually unranked constraints C1 and C2 is C1 + C2, for example &amp;quot;(vce ~ vce) + (cont ~ cont)&amp;quot;.</Paragraph>
  </Section>
  <Section position="5" start_page="22" end_page="24" type="metho">
    <SectionTitle>
4 Local Constraint Conjunction
</SectionTitle>
    <Paragraph position="0"> Smolensky (1995) and Kirchner (1996) propose a different method for combining constraints: local conjunction. A local conjunction of constraints is defined as a constraint that outputs a violation for each domain of a specified type in which all of the component constraints are violated. A constraint may be locally conjoined with itself, in which case the resulting conjunction outputs a violation whenever there are two violations of the component constraint within the specified domain. The conjunction of a constraint C1 with itself within a domain 7 may be notated &amp;quot;A(C1)/7.&amp;quot; The following algorithm computes the local conjunction of constraint C1, where C1 is represented by the weighted finite state machine T1 = (El, 22, Q1, Sl, F1, El), with itself within  a domain 7 defined as the intersection of the domains '71 A... A 'Tn: 1. Weaken C1 to a new constraint C1 ~ such that for any utterance to which C1 assigns a non-zero number n of violations, C1' assigns n - 1 violations. This may be accomplished as follows:  (a) Copy T1 as T~, renumbering the states of T2 so that there is no ambiguity.</Paragraph>
    <Paragraph position="1"> (b) Combine T1 and T2 into T = (E1,E2, Q1uQ2,sl,FiUF2, E= E1U (c) For each edge (qi, a, w, qj) E El, where  w &gt; 0, modify the edge to (qi, a, w1, s~}, where s2 is the state corresponding to the start state of T~. T represents constraint C1'.</Paragraph>
    <Paragraph position="2">  where G\[ represents the beginning of domain '7, G\[ represents anything other than G\[, -4-6 represents the interior of the domain, \]G represents a boundary between two '7 domains, and \]G represents the end of the '7 domain.</Paragraph>
    <Paragraph position="3"> The machine M will be used to limit the evaluation of constraint C1 ~ to the domain '7. To accomplish this, we need to define the behavior at the edges of the '7 domain. Outside the '7 domain, violations of C1 ~ will have no effect. At the left edge of the 3' domain, violations that do not involve the left. edge of constituents will have no effect. At the right edge of the '7 domain, violations that do not involve the right edge of constituents will have no effect. The final weighted finite state machine L representing the local conjunction of C1 with itself is produced by intersecting M with T, with the following modifications made to the intersection algorithm. Edges from T that are intersected with the edge G\[, or edges from T that are intersected with the edge G\[ and contain no reference to a left edge, or edges from T that are intersected with the edge \]G and contain no reference to a right edge, are assigned a weight of 0, and if their destination within T was state s2, their destination in T is treated as having been Sl. This has the effect of limiting the constraint violations of C1 ~ to the domain 7. Edges from T that are intersected with edge IG keep their original weight, but are treated as though their destination within T was sl. This has the effect of resetting C1 ~ to zero violations at the beginning of a '7 domain immediately following another.</Paragraph>
    <Paragraph position="4"> The constraint A(C1)/7 produced by the above algorithm outputs a violation for every violation of C1 after the first within domain '7. Thus A(C1)/7 penalizes two or more violations of C1 within '7, but does not penalize single violations of C1.</Paragraph>
    <Paragraph position="5"> For example, the constraint A_kA is represented as the following weighted finite state machine: null \]lA/1 The result of the above algorithm is the following machine: IWd/0 While this algorithm does not allow definition of local conjunction of different constraints,  it can be given nearly equivalent power by applying it to the output of complex constraints formed from mutually unranked subconstraints.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML