File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/87/j87-3003_intro.xml
Size: 3,418 bytes
Last Modified: 2025-10-06 14:04:38
<?xml version="1.0" standalone="yes"?> <Paper uid="J87-3003"> <Title>TOOLS AND METHODS FOR COMPUTATIONAL LEXICOLOGY</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1. INTRODUCTION. </SectionTitle> <Paragraph position="0"> It has become clear that the construction of computer systems that process natural language requires the creation of large computerized lexicons with extensive and accurate syntactic and semantic information about words (see Byrd(1986a), Ingria(1986)). It is also clear that it will be impossible to build these lexicons in the number and sizes required with only the manual labor of individual computer scientists, linguists, or lexicographers. There are too many systems requiring too much information about too many words for the manual approach to succeed.</Paragraph> <Paragraph position="1"> Fortunately, researchers are beginning to realize that the wealth of information in published dictionaries can be tapped by semi-automatic and fully-automatic meth- null ods, in order to help build the computerized lexicons we require. Work reported in Alshawi(1985), Calzolari (1983,1984a,b), Lesk(1986), Michiels(1982), etc., describe various attempts to decipher and extract information in machine-readable versions of published dictionaries (henceforth: MRDs).</Paragraph> <Paragraph position="2"> The present paper is intended to contribute to that literature by describing the tools and methods used by the Lexical Systems project at IBM Research. Individual tools and their use in that project have been described elsewhere (Chodorow, et a1.(1985), Byrd and Chodorow(1985), Byrd, et al.(1986b), Chodorow and Ravin(1987), Neff and Byrd(1987)). This paper differs from previous descriptions of our own and others' work, however, by presenting a framework and a set of general tools and methods that can be re-applied for a variety of lexicological and lexicographical applications. Furthermore, it addresses problems involved in coherently combining information from different MRDs. In particular, we show how we can transfer information from one dictionary to another -- even Copyright 1987 by the Association for Computational Linguistics. Permission to copy without fee all or part of this material is granted provided that the copies are not made for direct commercial advantage and the CL reference and this copyright notice are included on the first page. To copy otherwise, or to republish, requires a fee and/or specific permission. 0362-613 X/87/030219-240503.00 Computational Linguistics, Volume 13, Numbers 3-4, July-December 1987 219 Roy J. Byrd, Nicoletta Calzolari, Martin S. Chodorow, Judith L. Klavans, Mary S. Neff, Omneya A. Rizk Tools and Metltods across languages -- and correctly associate that information with the word senses to which it applies.</Paragraph> <Paragraph position="3"> The plan of this paper roughly parallels the logical sequence of problems that face any project in computational lexicology. Section 2 on acquisition and storage describes the initial stages of capturing and organizing dictionary data. Section 3 outlines the tools and methods we use to exploit the contents of machine readable dictionaries. Applications of the tools and methods are described in the final two sections which discuss applied computational lexicography for monolingual and bilingual dictionaries.</Paragraph> </Section> class="xml-element"></Paper>