File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/02/c02-1027_concl.xml

Size: 1,034 bytes

Last Modified: 2025-10-06 13:53:14

<?xml version="1.0" standalone="yes"?>
<Paper uid="C02-1027">
  <Title>Shallow language processing architecture for Bulgarian</Title>
  <Section position="6" start_page="0" end_page="0" type="concl">
    <SectionTitle>
5 Conclusion
</SectionTitle>
    <Paragraph position="0"> This paper outlines the development of the first robust and shallow text processing framework in Bulgarian LINGUA which includes modules for tokenisation, sentence splitting, paragraph segmentation, part-of-speech tagging, clause chunking, noun phrases extraction and anaphora resolution (Figure 1). Apart from the module on pronoun resolution which was adapted from Mitkov's knowledge-poor approach for English and the incorporation of BULMORPH in the part-of-speech tagger, all modules were specially built for LINGUA. The evaluation shows promising results for each of the modules.</Paragraph>
    <Paragraph position="1"> 7The optimisation made use of genetic algorithms in a manner similar to that described in (Orasan et al., 2000).</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML