File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/89/e89-1040_intro.xml
Size: 4,103 bytes
Last Modified: 2025-10-06 14:04:44
<?xml version="1.0" standalone="yes"?> <Paper uid="E89-1040"> <Title>AN APPROACH TO SENTENCE-LEVEL ANAPHORA IN MACHINE TRANSLATION</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Theoretical research in the area of machine translation usually involves the search for and creation of an appropriate formalism. An important issue in this respect is the way in which the compositionality of translation is to be defined.</Paragraph> <Paragraph position="1"> In this paper, we will introduce the anaphoric component of the Mimo formalism. It makes the definition and translation of anaphoric relations possible, relations which are usually problematic for systems that adhere to strict compositionality. In iVlimo, the translation of anaphoric relations is compositional. The anaphoric component is used to define linguistic phenomena such as wh-movement, the passive and the binding of reflexives and pronouns monolingually. The actual working of the component will be shown in this paper by means of a detailed discussion of wh-movement.</Paragraph> <Paragraph position="2"> Introduction Theoretical research as part of machine translation often aims at finding an appropriate formalism. One of the main issues involved is whether the formalism does full justice to the idea that the translation of a whole is built from the translation of its parts on the one hand and whether it leaves enough room for the treatment of exceptions on the other hand. In other words, the question is in what way the idea of compositionality is to be defined within a particular formalism. An answer to this question from an interlingual perspective is given in the literature on the Rosetta system (e.g. Landsbergen 1985). The CAT framework (e.g. Arnold et a\]. 1986) was meant to be an answer to the same question, this time for a transfer system, viz. the Eurotra system. The MiMo formalism is a reaction to the CAT framework and tries to solve several translation problems by formulating an alternative definition of compositionality. Phenomena involving anaphora I such as wh-movement and the coindexation of pronominais often cause problems for strictly compositional systems since translation of one word depends on (the translation of) another word, one which can be quite far away in the sentence. Rosetta tackles this problem by distinguishing between rules that are significant with respect to the composRionality of translation, so-called meaningful rules, and rules that are not, referred to as transformations (Appelo et al. 1987); in this way the system is not compositional in the strict sense anymore. The notion of compositlonaUty MiMo adheres to is defined in such a way that anaphoric relations can be translated compositionally as well. In this paper we wiLi introduce the anaphoric component of the MiMo formalism. It is used to define Linguistic phenomena such as wh-movement, the binding of reflexives and pronouns, the passive and control phenomena monolinguaLiy.</Paragraph> <Paragraph position="3"> The formalism will be discussed by means of an extensive description of a possible analysis of wh-movement.</Paragraph> <Paragraph position="4"> In the next section, we will first discuss and motivate some of the more fundamental characteristics of the MiMo translation system. Section two will sketch the MiMo formalism 1In thls paper the term 'ansphoriC/' should be interpreted in the broaclest lense, as opposed to Chomsky 1981 in which only A-traces and reflexives are called anaphoric.</Paragraph> <Paragraph position="5"> - 299as far as necessary for understanding what will follow. The component that deais with the treatment of anaphora will be discussed in section 3. In the fourth section the actual working of the component will be shown by an elaborate discussion of wh-movement. Finaily, the translation of anaphoric relations will be defined and some idea will be given of the kind of problems that remain and that will have to be subject to further research.</Paragraph> </Section> class="xml-element"></Paper>