File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/98/p98-2212_concl.xml
Size: 2,839 bytes
Last Modified: 2025-10-06 13:58:09
<?xml version="1.0" standalone="yes"?> <Paper uid="P98-2212"> <Title>Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text</Title> <Section position="8" start_page="1302" end_page="1304" type="concl"> <SectionTitle> 6 Conclusion </SectionTitle> <Paragraph position="0"> The parse-parse-match approach to automatically building bilingual lexicons in not novel. Proposals have been put forward, e.g., by Sadler and Vendelmans (1990) and Kaji eta/. (1992).</Paragraph> <Paragraph position="1"> Wu (1995) points out some possible difficulties of the parse-parse-match approach. Among them, the facts that &quot;appropriate, robust, monolingual grammars may not be available&quot; and &quot;the grammars may be incompatible across languages&quot; (Wu, 1995, 355). More generally, in bilingual lexicon development there is a tendency to minimize the need for linguistic resources specifically developed for the purpose.</Paragraph> <Paragraph position="2"> In this view, several proposals tend to use statistical, knowledge-free methods, possibly in combination with the use of existing Machine Readable Dictionaries (see, e.g., Klavans and Tzoukermann (1995), which also contains a survey of related proposals, pages 195-196).</Paragraph> <Paragraph position="3"> The present proposal tackles the problem from a different and novel perspective. The acknowledgment that MT is the main application domain to which bilingual resources are relevant is taken as a starting point. The existence of an MT system, for which the bilingual lexicon is intended, is explicitly assumed. The potential problems due to the need for linguistic resources are by-passed by having the necessary resources available in the MT system. Rather than doing away with linguistic knowledge, the pre-existing resources of the pursued application are utilized.</Paragraph> <Paragraph position="4"> An approach like the present can be most effectively adopted to develop tools allowing MT systems to automatically build their own bilingual lexicons. A tool of this sort would use no extra resources in addition to those already available in the MT system itself. Such a tool would take a small sample of a bilingual lexicon and use it to bootstrap the automatic development of a large lexicon. It is worth noting that the bilingual pairs thus produced would be complete bilingual entries that could be directly incorporated in the MT system, with no post-editing or addition of information.</Paragraph> <Paragraph position="5"> The only requirement placed by the present approach on MT systems is that they be bidirectional. Therefore, although aimed at the development of specific applications for specific MT systems, the approach is general enough to apply to a wide range of MT systems.</Paragraph> </Section> class="xml-element"></Paper>