File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/97/w97-0216_abstr.xml
Size: 3,626 bytes
Last Modified: 2025-10-06 13:49:02
<?xml version="1.0" standalone="yes"?> <Paper uid="W97-0216"> <Title>Semantic Tagging and NLP Applications</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> DFKI GmbH </SectionTitle> <Paragraph position="0"> (German Research Center for Artificial Intelligence) {declerck; klein}@dfki, uni-sb, de There are hardly any an_notation schemes including semantic information, with the exception of Princeton WordNet (which will be extended by EuroWordNet for European languages). But some projects already addressed this topic, like FraCaS (Framework for Computational Semantics), or are starting to do this, like DIET (Diagnostic and Evaluation Tools for NL Applications, an extension of the TSNLP framework, see Lehmann et al., Coling 96) 1.</Paragraph> <Paragraph position="1"> What makes semantic tagging appealing is, among others, the (justified) hope that it will contribute to the improvement of the performances and the robustness of NLP systems. Besides this aspect, evaluation will also benefit from semantically tagged test corpora.</Paragraph> <Paragraph position="2"> In this working session, we focus on both the question how semantic tagging can support the development of NLP applications and, the other way round, how NLP systems can support semantic tagging. Among the different NLP projects making a (limited) use of semantic annotations, we are aiming at common annotation methodologies beyond particular approaches.</Paragraph> <Paragraph position="3"> As an example we describe a scenario which has been adopted within the context of a NLP project concerned with appointment sched~lling (COSMA, see Bnsem~_nn et al., ANLP 97), where information extraction techniques combined with a shallow-parsing strategy (see Neumann, ANLP 97) have been used in order to process just the relevant fragments of input texts.</Paragraph> <Paragraph position="4"> To support the development of the system and to delimit the linguistic coverage of the NLP application, a small corpus has been semantically hand-tagged, where the semantic annotations have been added to the mainly syntactic annotation scheme of the TSNLP framework. Thus the evaluation tool of TSNLP has been extended by a certain class of semantic information (non-ambiguous temporal expressions). Furthermore the FST automata developed for the purpose of messag e extraction have been designed along the lines of this annotation scheme. And the output of the FST automata has been defined in such a way that they can be used for an automatic (rule-based) semantic annotation of new text input (the annotation being limited to the temporal expression).</Paragraph> <Paragraph position="5"> Other NLP applications could reuse such a simple annotation in order to determlne, for example, selectional restrictions or text classifications. Addressing the reusability of annotation schemes for particular domains, one will have to consider if they can be just added to existing morpho-syntactic annotation schemes, as we described in the example above, or if the annotation work should be started from scratch, which could be necessary for more complex applications. A recently developed annotation scheme (see Skut et al., ANLP 97) is proposing an architecture with multiple levels of linguistic representation, for argument structure, grammatical function and syntactic category. We will investigate how semantic information can be integrated in such a framework and if the bidirectional interface between semantic tagging and NLP system, described above, can be adopted to this architecture.</Paragraph> </Section> class="xml-element"></Paper>