File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/99/w99-0511_intro.xml

Size: 1,536 bytes

Last Modified: 2025-10-06 14:06:58

<?xml version="1.0" standalone="yes"?>
<Paper uid="W99-0511">
  <Title>A Computational Lexicon of Portuguese for Automatic Text Parsing Ehsabete RANCHHOD FLUL/CAUTL-IST</Title>
  <Section position="3" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction and Background
</SectionTitle>
    <Paragraph position="0"> The French DELA system was conceived and developed at LADL (Laboratoire d'Automatlque Documentaire et LmgmstIque) It includes monohngual hngulstlc resources (mainly for French and English) specifically elaborated to be integrated into NLP systems Standard methods and formats-have been defined and are now used by other national teams working on their own languages German, Greek, Italian, Portuguese and Spanish Within that common framework, important fragments of the descnptmn of the languages involved have been worked out the syntactic and semantic properties of free and frozen sentences are descnbed and formalized e~s for the lexicon, a major component of NLP, large coverage electromc dlctionanes have been built Simple and compound words have been descnbed, and their hnguIstlc characteristics have been hand-coded by computational lexicographers using a common method Most of these lex~cal resources can now be imported into the Intex NLP system ~, and then automatically applied to large texts Within the scope of this article, we describe the set of lexical resources built so far for Portuguese, and we gwe different examples of automatic Portuguese text parsing</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML