File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/98/w98-1208_intro.xml
Size: 1,826 bytes
Last Modified: 2025-10-06 14:06:47
<?xml version="1.0" standalone="yes"?> <Paper uid="W98-1208"> <Title>I! | Implementing a Sense Tagger in a General Architecture for Text Engineering</Title> <Section position="3" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> This paper is about two things: a novel hybrid sense tagger for unrestricted text (Wilks and Stevenson, 1997), and the experience of developing this system within GATE - a General Architecture for Text Engineering (Cunninham et al., 1997; Cunningham, Wilks, and Gaizauskas, 1996a).</Paragraph> <Paragraph position="1"> We hope you can forgive this mild schizophrenia - we feel that both topics are relevant to the sub-ject of new methods in NLP: the first because both the problem of arriving at methods for sense tagging and of tuning those methods to specific domains and lexical resources is an increasingly active topic in the field (Basili, Della Rocca, and Pazienza, 1997; Harley and Gleanon, 1997); the second because work on NLP components shares a heap of problems with other language processing work to do with reusability, data visualisation, and software-level robustness and efficiency that, we feel, are best solved by the provision of a inclusive and general architecture and development environment for the field.</Paragraph> <Paragraph position="2"> We begin with a review of the general concept behind GATE (section 2), then describe the practicalities of the system that are relevant to the sense tagging system we have developed (section 3). Next we discuss the sense tagging problem (section 4), and then our system (section 5). Finally we look at the experience of developing the tagger within the architecture (section 6), and draw out some lessons for the future (section 7).</Paragraph> </Section> class="xml-element"></Paper>