File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/94/a94-1032_intro.xml
Size: 2,556 bytes
Last Modified: 2025-10-06 14:05:36
<?xml version="1.0" standalone="yes"?> <Paper uid="A94-1032"> <Title>Automatic Aquisition of Semantic Attributes for User Defined Words m Japanese to English Machine Translation Satoru Ikehara *, Satoshi Shirai * LAkio Yokoo *</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1. Introduction </SectionTitle> <Paragraph position="0"> When working with a MT(machine translation) system, users compile user dictionaries for the words which are not registered in the system dictionaries or for those with inappropriate translations \[1\]. But when registering new words in a dictionary, there is a need to give not just the index word and translated word, but also syntactic, semantic and various other information. Systems aiming at higher levels of translation quality require more detailed and accurate information \[2,3\], and it is no simple task for laymen to give such information. In particular, semantic information usually requires the skill of professionals.</Paragraph> <Paragraph position="1"> In this paper, attention has been focused on the characteristics of user defined words. A method is proposed where for index words (noun words or compound nouns) in the original language that users seek to register, one need give only the translation in the target language to permit the system apply the knowledge held in the system dictionaries, automatically judge the type of noun and determine the SAs of the word for the noun types.</Paragraph> <Paragraph position="2"> 2. System Dictionaries and User Dictionaries</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 2.1 Semantic Dictionaries in ALT-J/E </SectionTitle> <Paragraph position="0"> Here, we shall refer to the dictionary prepared in advance by the MT system as a system dictionary and the dictionary prepared and used by users as a user dictionary.</Paragraph> <Paragraph position="1"> For the case of the Japanese to English MT system ALT-J/E\[4\], the relationship between the system dictionaries, the user dictionaries and word SAs are shown in Figure 1. In the semantic dictionaries, semantic information is written using SAs.</Paragraph> <Paragraph position="2"> Common nouns in the semantic word dictionary are given common noun SAs (generally more than one). For proper nouns, both common noun SAs and proper noun S/ks(beth more than one) are given. Verbal SAs are given to sentence patterns registered in the semantic structure dictionary \[5\].</Paragraph> </Section> </Section> class="xml-element"></Paper>