File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/97/w97-0205_abstr.xml
Size: 4,504 bytes
Last Modified: 2025-10-06 13:49:02
<?xml version="1.0" standalone="yes"?> <Paper uid="W97-0205"> <Title>Adaptable Semantic Lexicon with Systematic Pol-</Title> <Section position="1" start_page="0" end_page="25" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> The paper defends the notion that semantic tagging should be viewed as more than disambiguation between senses. Instead, semantic tagging should be a first step in the interpretation process by assigning each lexJ.cal item a representation of all of its sy=stematically related senses, from which fuxther semantic processing steps can derive discourse dependent interpretations. This leads to a new type of semantic lexicon (CoRv.Lzx) that supports underspecified semantic tagging through a design based on systematic polysemous classes and a class-based acquisition of lexical knowledge for specific domains.</Paragraph> <Paragraph position="1"> 1 Underspecified semantic tagging Semantic tagging has mostly been considered as nothing more than disambiguation to be performed along the same lines as part-of-speech tagging: given n lexical items each with m senses apply linguistic heuristics and/or statistical measures to pick the most likely sense for each lexical item (see eg: (Yarowsky, 1Q92) (Stevenson and Wilks, 1997)).</Paragraph> <Paragraph position="2"> I do not believe this to be the right approach because it blurs the distinction between 'related' (systematic polysemy) and 'unrelated' senses (homonymy : bank - bank). Although homonyms need to be tagged with a disambiguated sense, this is not necessarily so in the case of systematic polysemy. There are two reasons for this that I will discuss briefly here. First, the problem of multiple reference. Consider this example from the BROWN corpus: \[A long book heavily weighted with milltary technlcalities\]Np, in this edition it is neither so long nor so technical as it was originally.</Paragraph> <Paragraph position="3"> The discourse marker (it) refers back to an NP that expresses more than one interpretation at the same time. The head of the NP (book) has a number of systematically related senses that are being expressed simultaneously. The meaning of book in this sentence cannot be disambiguated between the number of interpretations that are implied: the informational content of the book (military technicalities), its physical appearance (heavily weighted) and the events that are involved in its construction and use (long).</Paragraph> <Paragraph position="4"> The example illustrates the fact that disambiguation between related senses is not always possible, which leads to the further question if a discrete distinction between such senses is desirable at all. A number of researchers have answered this question negatively (see eg: (Pustejovsky, 1995) (Killgariff, 1992)). Consider these examples from BROWN: (1) fast run-up (of the stock) (2) fast action (by the city government) (3) fast footwork (by Washington) (4) fast weight gaining (5) fast condition (of the track) (6) fast response time (7) fast people (8) fast ball Each use of the adjective 'fast' in these examples has a slightly different interpretation that could be captured in a number of senses, reflecting the different syntactic and semantic patterns. For instance: 1. 'a fast action' (1, 2, 3, 4) 2. 'a fast state of affairs' (5, 6) 3. 'a fast object' (7, 8) On the other hand all of the interpretations have something in common also, namely the idea of 'speed'. It seems therefore useful to underspecify the lexical meaning of 'fast' to a representation that captures this primary semantic aspect and gives a general structure for its combination with other lexical items, both locally (in compositional semantics) and globally (in discourse structure).</Paragraph> <Paragraph position="5"> Both the multiple reference and the sense enumeration problem show that lexical items mostly have an indefinite number of related but highly discourse dependent interpretations, between which cannot be distinguished by semantic tagging alone. Instead, semantic tagging should be a first step in the interpretation process by assigning each lexical item a representation of all of its systematically related 'senses'. Further semantic processing steps derive discourse dependent interpretations from this representation. Semantic tags are therefore more like pointers to complex knowledge representations, which can be seen as underspecified lexical meanings.</Paragraph> </Section> class="xml-element"></Paper>