File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/98/p98-1097_intro.xml

Size: 2,901 bytes

Last Modified: 2025-10-06 14:06:34

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-1097">
  <Title>Improving Automatic Indexing through Concept Combination and Term Enrichment</Title>
  <Section position="3" start_page="0" end_page="595" type="intro">
    <SectionTitle>
2 Basic Controlled Indexing
</SectionTitle>
    <Paragraph position="0"> The preprocessing of the corpus by the tagger yields a morphologically analyzed text, with unambiguous syntactic categories. Then, the tagged corpus is automatically indexed by FASTR which retrieves occurrences of multi-word terms or variants (see Table 1).</Paragraph>
    <Paragraph position="1">  La variation mensuelle de la respiration du sol et ses rapports avec l'humiditd et la tempdrature du sol ont dtd analysdes dans le sol super\]iciel d'une for~t tropicale. (The monthly variation of the respiration of the soil and its connections with the moisture and the temperature of the soil have been analyzed in the surface soil of a tropical forest.) il 007019 Respiration du sol Occurrence respiration du sol (respiration of the soil) i2 002904 Sol de for~t Embedding2 so_.__l superficiel d'une \]or~t (surf. soil of a forest) i3 012670 Humiditd du sol Coordination1 humiditd et la tempdrature du sol (moisture and the temperature of the soil) i4 007034 Tempdrature du sol Occurrence tempdrature du sol (temperature of the soil) i5 007035 Analyse de sol VerbTransfl analysdes clans le sol (analyzed in the soil) i6 007809 For~t tropicale Occurrence for~t tropicale (tropical forest) Each variant is obtained by generating term variations through local transformations composed of an input lexico-syntactic structure and a corresponding output transformed structure. Thus, VerbTransfl is a verbalization which transforms a Noun-Preposition-Noun term into a verb phrase represented by the variation pattern V 4 (Adv ? (Prep ? Art \[ Prep) A ?) N3:1</Paragraph>
    <Paragraph position="3"> The constraint following the output structure states that V4 belongs to the same morphological family as N1, the head noun of the term. VerbTransfl recognizes analys~es\[v\] dans\[prep\] le\[nrt\] sOl\[N\] (analyzed in the soil) as a variant of analyse\[N\] de\[Prep\] sol\[N\] (soil analysis).</Paragraph>
    <Paragraph position="4"> Six families of term variations are accounted for by our implementation for French: coordination, compounding/decompounding, term embedding, verbalization (of nouns or adjectives), nominalization (of nouns, adjectives, or verbs), and adjectivization (of nouns, adjectives, or verbs). Each index in Table 1 corresponds to 1The following abbreviations are used for the categories: V = verb, N = noun, Art = article, hdv --- adverb, Conj = conjunction, Prep --- preposition, Punc -- punctuation. null a unique term; it is referenced by its identifier, its string, and a unique variation of one of the aforementioned types (or a plain occurrence).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML