File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/c00-1019_abstr.xml

Size: 1,759 bytes

Last Modified: 2025-10-06 13:41:33

<?xml version="1.0" standalone="yes"?>
<Paper uid="C00-1019">
  <Title>OUVRIERES LA BOUR FA(,J()N h;VENT P * 17' I~VIDLNCL CLEARLY EVIDh;NC\]'; OBVIO USIN HOMMF, S POIATICIANS PRISONNIFJ{S PR/SONEI{S RETOUR, BA.CII(, REVENIR BACK CONVENU AGREED SIGNE SIGNEI) VU SEEN AGRJCOLE AGR1C UL'I'URE ENT'IER AROUN\]) E N T I ER T Ill RO U G I\] O U T OCCIDENTAL WESTERN AVIDUGLI~S BI,IND CIIA.USSURI'2S SI-IOES CONSTRUC;I'EURS BUILDh;RS PENSIONN, F,S PENSIONERS RISTRAITES PENSIONERS VETEMENTS CLOTHING POISSON FISI\] PORC IK)RK Figure 2: Sanli)le Chlsters</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> l~t:ovious work has shown thai adding genera.liza.tion of the exa.ml)les in the corpus of a.n exa.ml)le-1)ased machine tra.nsla.tion (I'31LMT) system ea, n reduce 1;he re(ltfire.d amount o\[' pretra.nsla.ted exa.ml)le text l)y as \[iltl(;\]l }is a.ii order o\[' magnitude for Spa.nish-l';nglish and l,'renchl~;nglish I+',I~Mrl '. Using word clusto.t:itlg to a.tttoma.ticaJly generalize the example eorl&gt;uS ca.n provide the majority o\[' this inlprovement for l,'rench-l'hlglish wil;h no nlanuaI illtervelltioll; the prior work required a. la.rge I)iliugual diclionary ta.gged wil;}l 1)a.rls of speech aud the manual crea.tion of gl'.%llllll.:ll&amp;quot; rules. /~y seeding the clustering with a. small a.mou nt of manuallycrea.ted iM'orma.tion, even t)el;ter t)erl'ornla.nce ea.n be a.chieved. This pa.l)ev descril)es a. method whereby bilingual word clustering ca.n 1)e per\[brined using sta.nda.rd 'nto,zoli'n.qttal document cl ustering techniques, a, nd its e\[l'ectiveness at red ucing the. size of the exam l)le corpus ,'eq u ire(I.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML