File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/90/c90-3069_intro.xml

Size: 1,785 bytes

Last Modified: 2025-10-06 14:04:56

<?xml version="1.0" standalone="yes"?>
<Paper uid="C90-3069">
  <Title>AN INTEGRATED SYSTEM FOR MORPHOLOGICAL ANALYSIS OF THE SLOVENE LANGUAGE</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1. Introduction
</SectionTitle>
    <Paragraph position="0"> We present an integrated environment for morphological analysis of (written) word-forms of the Slovene language. The language belongs to the Slavic family of languages, but exhibits some very idiosyncratic properties (e.g. having also a &amp;quot;dual&amp;quot; number and very rich inflection). Our project of writing a morphological analyzer and synthesizer (MAS) for the Slovene language has had primarily two aims. First, to write a useful MAS, which could serve as a front-end to other Slovene language processing systems, and second, to implement a model general enough to allow us to facilitate the study of Slovene morphology.</Paragraph>
    <Paragraph position="1"> The work on the project itself is split into two parts much along the same lines. First of the task of selecting and implementing a model versatile enough to cover the quirks of Slovene morphology, and second, the task of writing down the rules of Slovene morphology (Toporigie 84) in the formalism of the chosen model.</Paragraph>
    <Paragraph position="2"> The two-level model of Kimmo Koskeniemmi (Karttunen 88, Koskenniemi 84,85,86) was selected as the basic scheme for our MAS, our choice being influenced - among other things - by its prevalence in current (computer) morphological studies. This makes the system well documented and thus easy to implement, as well as simplifying the task of writing the rules for (phonologically) induced alternations of Slovenian word-forms (Erjavec 89).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML