File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/94/c94-1015_intro.xml

Size: 4,736 bytes

Last Modified: 2025-10-06 14:05:35

<?xml version="1.0" standalone="yes"?>
<Paper uid="C94-1015">
  <Title>Constituent lloundary Parsing for Exanll)lo-lkised Maclhine Tr,'inslation</Title>
  <Section position="3" start_page="0" end_page="705" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> I-xample-basod franieworks are increasingly being applied to machiilo translatioi/, since th0y c~.ill l)rovido efficient and robust processing (Nagao, 1984; Sate, 1991; Sumita, 1992; Furuse, 1992; Watanabe, 1992).</Paragraph>
    <Paragraph position="1"> However, in order to make tilt best use o1 the a(.lv:.lnlages of an example-based fl'amcwork, it is essential to effectively integrate an example-based method anti source language analysis. Unfortunately, whcll all exainl)le-based nletiiod ix combined with a SOUFC0 lnnguago analysis inelhod having cOlnl)lox l~r~illilliflr rules, pulling a heavy load eli translalion, the advai/lai;os of lhe example-based franiowork iilay l)e ruined. To achieve efficient and robnst processing by the exanii)lc-basod framework, a lot of sttldies have beell nlado for the pui\])ose of combining source lal!gtiage analysis with all example-based method, lind of efficiently covering the analyzed source langilllge strtiCttlro by me;illS of trailsfcr knowledge (Grishman, 1992; Jollcs, 1992; McLean, 1992; Manlyama, 1992, 1993; Nirenburg 1993).</Paragraph>
    <Paragraph position="2"> One wily to reduce tilt load of source langua!,,c analysis ix to directly apply trallSl'cr knowledge to all input siring, which sinlultaneously executes both siruciinal parsing aiM transfer knowlc.dgo al)lHication through pattorll-il/atchii/g, l:'allerll-nlalchi~ig does liot rise grainillaticaI symbols such as &amp;quot;Notlil Pliraso&amp;quot;, but uses surfi.ice words an(\] non-granlmalical synlbols. Therefore, in patlern-matching, rule coml)otition is reduced, and linguistic structure is expressed in a simpler manner thall ill gramnmr-based parsing. Thus, pattern-nlatcifing achieves efficient 1)arsing. It is also useful in treating spoken language, which sometimes deviates from convcntion:ll grammar, while grammar-based p,'lrsing has difficulty treating ilnreslricle(l spoken I\[ingllll,ge.</Paragraph>
    <Paragraph position="3"> This pal)Or proposes a constituom boundary parsing method based on paltorn-niatching, and shows its effeclivonoss for spoken langnago translation within the exaniple-I)asod framework. In otlr parsing method, aii inl)Ut string .is applied linguistic patterns expressing some linguistic constitticnts and their bonnds-lrios, in a top-down f:.tshion. \Vhon structural anlbiguity occurs, tile most phlusi/)lo structure is selected rising the total vahios of dislanco calculations in tilt example-based lrs-Illiowork. Shico the description of a linguistic ps-ittern is sinlplo, it is easy to update by adding f0etlback.</Paragraph>
    <Paragraph position="4"> A constiLuonl boundary ixusing method using nuitual illfoiillation i~ l)roposed in (M,'lgerlflan 1990). This method accouilts for the unrestricted lls-ltLlra\] langtlage and is efficient, llowever, it tends to be illacctirate&gt; and difficult, to ad(l feedback to, since it completely depends on st'ltistical information withoul, resort to a linguistic viewpoint. On the cont,ary&gt; in order to achieve accurate parsing and Iransb'ition, our conslituent boundary parsing method implicitly incorporates grammatical information into p'ltterns, e.g. constituent boundary description by a i)art-of-sl)eech bigrani, and classification of i)ailerns according lo linb, uislic levels such s.ls simple sentence ,tlrld tlOtHI l)hrase.</Paragraph>
    <Paragraph position="5"> Tlallsfer-Orivell Maehillo TranslatiOll (TI)MT) (\[:tlrtiso&gt; 1992, 1994) uses tile COl/Stil.llont botlndary 1)a~sint ,, liielhod l)l'eSollto(l in this paper, as an alternative to glamliiar-based ali:.ilysis, aiKI lliakos the i)ost ilSe of the ex:lmplo-based framework. A bidirectional translation syslcnl between Jap,'lnesc lind English for dialogue sentences concerning international conference regislralions has been illlplenlented (Sobashima, 1994).</Paragraph>
    <Paragraph position="6"> l~xperimonts with the systonl have shown ollr parsing iiicthod I() t~ effcctive.</Paragraph>
    <Paragraph position="7"> Section 2 defines patterns expressed by variables and con.&lt;;liluont boundaries. Section 3 OXl)lains a method for derivin{, possible English structures. Soelion 4 explain'4 structural disanibi,gnaliOti using tlislanco calculations in Iho oxanilflo-b,'lsed framework. Section 5 exphlins an example of Japanese sent0nee analysis using our consliluont boundary parsing method&gt; and Section 6  reports on the experimental resulLs.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML