File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/98/p98-2181_intro.xml
Size: 2,529 bytes
Last Modified: 2025-10-06 14:06:40
<?xml version="1.0" standalone="yes"?> <Paper uid="P98-2181"> <Title>Building Accurate Semantic Taxonomies from Monolingual MRDs</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> There is no doubt about the increasing need of owning accurate and broad coverage general lexical/semantic resources for developing NL applications. These resources include Lexicons, Lexical Databases, Lexical Knowledge Bases (LKBs), Ontologies, etc. Many researchers believe that for effective NLP it is necessary to build a LKB which contain class/subclass relations and mechanisms for the inheritance of properties as well as other inferences. The work presented here attempts to lay out some solutions to overcome or alleviate the &quot;lexical bottleneck&quot; problem (Briscoe 91) providing a methodology to build large scale LKBs from conventional dictionaries, in any language. Starting with the seminal work of (Amsler 81) many systems have followed this approach (e.g., Bruce et al. 92; Richardson 97).</Paragraph> <Paragraph position="1"> Why should we propose another one? Regarding the resources used, we must point out that most of the systems built until now refer to English only and use rather rich, well structured, controlled and explicitly semantically coded dictionaries (e.g. LDOCE 87). This is not the case for most of the available sources for languages other than English. Our aim is to use conventional MRDs, with no explicit semantic coding, to obtain a comparable accuracy.</Paragraph> <Paragraph position="2"> The system we propose is capable of 1) performing fully automatic extraction (with a counterpart in terms of both recall and precision fall) of taxonomic links of dictionary senses and 2) ranking the extracted relations in a way that selective manual refinement is allowed.</Paragraph> <Paragraph position="3"> Section 2 shows that applying a conventional pure descriptive approach the resulting taxonomies are not useful for NLP. Our approach is presented in the rest of the paper. Section 3 deals with the automatic selection of the main semantic primitives present in Diccionario General Ilustrado de la Lengua Espafiola (DGILE 87), and for each of these, section 4 shows the method for the selection of its most representative genus terms. Section 5 is devoted to the automatic acquisition of large and accurate taxonomies from DGILE. Finally, some conclusions are drawn.</Paragraph> </Section> class="xml-element"></Paper>