File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/69/c69-0401_concl.xml
Size: 1,485 bytes
Last Modified: 2025-10-06 13:55:54
<?xml version="1.0" standalone="yes"?> <Paper uid="C69-0401"> <Title>Automatic Processing of Foreign Language Documents</Title> <Section position="7" start_page="0" end_page="0" type="concl"> <SectionTitle> 7. Conclusion </SectionTitle> <Paragraph position="0"> An experiment using a multi-lingual thesaurus in conjunction with two d~.fferent document collections, in German and English respectively, has shown that cross-language processing (for example, German queries against English documents) is nearly as effective as processing within a single language. Furthermore, a simple translation of thesaurus categories appears to produce a document content analysis which is equally effective in English as in German. In particular, differences in morphology (for example, in the suffix cut-off rules\], and in language ambiguities do not seem to cause a substantial degradation when moving from one language to another.</Paragraph> <Paragraph position="1"> For these reasons, the automatic retrieval methods used in the SMART system for English appear to be applicable also to foreign language material.</Paragraph> <Paragraph position="2"> Future experiments with foreign language documents should be carried out using a thesaurus that is reasonably complete in all languages, and with identical query and document collections for which the same relevance judgments may then be applicable across all runs.</Paragraph> <Paragraph position="3"> -28-</Paragraph> </Section> class="xml-element"></Paper>