File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/w04-2206_intro.xml
Size: 1,895 bytes
Last Modified: 2025-10-06 14:02:47
<?xml version="1.0" standalone="yes"?> <Paper uid="W04-2206"> <Title>A Method of Creating New Bilingual Valency Entries using Alternations</Title> <Section position="3" start_page="0" end_page="0" type="intro"> <SectionTitle> 2 Resources </SectionTitle> <Paragraph position="0"> We use two main resources in this paper: (1) a seed lexicon of high quality hand-made valency entries; and (2) lists of verbs that undergo one or more S=O alternations.</Paragraph> <Paragraph position="1"> The alternation list includes 449 native Japanese verbs that take the S=O alternation, based on data from Jacobsen (1981), Bullock (1999) and the Japanese/English dictionary EDICT (Breen, 1995). Each entry consists of a pair of Japanese verbs with one or more English glosses. Expanding out the English results in 839 Japanese-English pairs in all. Some examples are given in Table 1.</Paragraph> <Paragraph position="2"> As a seed lexicon, we use the valency dictionary (Ikehara et al., 1997) from the Japanese-to-English machine translation system ALT-J/E. It consists of linked pairs of Japanese and English verbs. There are 5,062 Japanese verbs and 11,214 entries (ignoring all idiomatic and adjectival entries). Verb entries in both languages have information about the argument structure (subcat) of the verb. In addition to the core arguments, adjunct cases are added to many patterns to help in disambiguation.1 The Japanese side has selec1This is common in large NLP lexicons, such as COMtional restrictions (SR) on the arguments. The arguments are linked between the two languages using case-roles (N1, N2, ...).</Paragraph> <Paragraph position="3"> The seed lexicon covered 381 out of the 449 linked Japanese pairs (85%). In the next section, in order to examine the nature of the alternation we compare the case roles and translation of the linked valency pairs.</Paragraph> </Section> class="xml-element"></Paper>