File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/82/c82-2054_abstr.xml
Size: 5,582 bytes
Last Modified: 2025-10-06 13:46:02
<?xml version="1.0" standalone="yes"?> <Paper uid="C82-2054"> <Title>ON AN APPROACH FOR DESIGNING LINGUISTIC PROCESSORS</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> ON AN APPROACH FOR DESIGNING LINGUISTIC PROCESSORS </SectionTitle> <Paragraph position="0"> Mathematics with Computer Center, Bulgarian Academy of Sciences, P.OvB. 373, I090 Sofia, Bulgaria The present paper discusses the principles of designing a system for man-computer di~lo6ue in natural, language. The system is being elaborated at the Laboratory of Mathematical Linguistics at the Institute of Mathematics with Computer Center of the Bulgarian Academy of Sciences.</Paragraph> <Paragraph position="1"> The desorlbed system requires: I. A form~ description of the syntax of basic nuclear structures of the natural lanEuage sentences which has to be used during the process of syntactical analysis of the users&quot; phrases in a natural language~ 2. Techniques for representing in the computer the knowledge about a given problem area. The obtained description has to be used during the process of semantical analysis of the users&quot; phrases in a natural language! 3. A dictionary containing the terms used in the chosan problem area. The dictionary is compiled by specialists in the given problem area; 4e A dictionary containing the eel-vice vocabulary (independent of a particular problem area) - word-groups as &quot;less than&quot;, &quot;geater than&quot;, &quot;equal to&quot;, &quot;slid&quot;, &quot;or&quot;, &quot;as... ash etc.</Paragraph> <Paragraph position="2"> 5. A basic software system - a data base management - 222 - null system (for example, a relational DBMS), or a program package (for example, BMDP - bio-medical data processing). These ready software products service users of- a given problem area and they usually have their own language for describing and processing the data.</Paragraph> <Paragraph position="3"> The considered system is viewed as a superstructure over the given software system and it performs: 1. Translation from a language close to the natur~al one into the internal software system's language for description or processing the data; 2. An effort to analyse the correctness of the user's request in terms of the given problem area description and messages when discovering mistakes! 3. When ambiguities happen a dialogue is carried out in order to make the user's request unambiguous.</Paragraph> <Paragraph position="4"> 4. ~rocessing the results from the basic software sys- null tem and final shaping of the messages in the context of the user's request.</Paragraph> <Paragraph position="5"> Such a system is called a linguistic processor (or linguistic preprocessor) and is considered as a dynamic system which can be generated over various problem areas and various basic software systems.</Paragraph> <Paragraph position="6"> By &quot;a natural fan.age as a tool for the man - computer dialogue&quot; in this paper we mean the following: - a fixed list of appropriate natural language phrases is not aseigned~ - restrictions are not imposed on the ~rammatical structures of the sentences the man enters into the computer! - man has to interact with the computer in a correct natural language.</Paragraph> <Paragraph position="7"> - 223 - null Dictionary of the terms Structural description of the problem area of the problem area user analysis of terms I</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> Results </SectionTitle> <Paragraph position="0"> Output ,to the user L Fig.1. Functional relationships between different parts of the linguistic processor and the problem area descriptions. - 224 The describe approach for creatin~ linguistic processors can be qualified as a semantic - syntactical one since semantical analysis in terms of the given problem area description has a leading role in the process of &quot;understanding&quot; of the input phrase in a natural language, while syntactical analysis has an auxiliary role. The structural description of the prob~ lem area is a network designed for the representation of the problem area semantics and is viewed as a static, inmoblle and determinative tool for &quot;understanding&quot; and accomplishing the dialogue. The syntactical analysis does not presume a full scanning of the input sentence. It involves analysis of syntactical dependences at different levels with respect to the problem area complexity and, naturally, to the input phrase complexity. By means of an elaboration of the formal description of Bulgarian syntax, an improvement of the techniques for syntactical analysis is envisaged.</Paragraph> <Paragraph position="1"> When workir~ out the principles of construction of linguistic processors, some peculiarities of the Bulgarian lan~age have been taken into account, namely, its flexiona~ character, the absence of a case system, free word-order, etc.</Paragraph> <Paragraph position="2"> The approach proposed for the realization of the man computer dialogue in a natural language allows to process problem areas with different complexity. Linguistics processors at two levels are bein~ elaborated at the Laboratory of Mathematical Linguistics - for access of users - nonprogr~mnere to a relational data base in Bulgarian l~ags and for access of users - nonprogrammere to the facilities for stat-Istloal analysis proposed by the program package \]~DI )</Paragraph> </Section> </Section> class="xml-element"></Paper>