File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/p03-1068_intro.xml
Size: 2,852 bytes
Last Modified: 2025-10-06 14:01:49
<?xml version="1.0" standalone="yes"?> <Paper uid="P03-1068"> <Title>Towards a Resource for Lexical Semantics: A Large German Corpus with Extensive Semantic Annotation</Title> <Section position="3" start_page="0" end_page="0" type="intro"> <SectionTitle> 2 Resources </SectionTitle> <Paragraph position="0"> SALSA currently extends the TIGER corpus by semantic role annotation, using FrameNet as a resource. In the following, we will give a short overview of both resources.</Paragraph> <Paragraph position="1"> FrameNet. The FrameNet project (Johnson et al., 2002) is based on Fillmore's Frame Semantics. A frame is a conceptual structure that describes a situation. It is introduced by a target or frame-evoking element (FEE). The roles, called frame elements (FEs), are local to particular frames and are the participants and props of the described situations.</Paragraph> <Paragraph position="2"> The aim of FrameNet is to provide a comprehensive frame-semantic description of the core lexicon of English. A database of frames contains the frames' basic conceptual structure, and names and descriptions for the available frame elements. A lexicon database associates lemmas with the frames they evoke, lists possible syntactic realizations of FEs and provides annotated examples from the BNC. The current on-line version of the frame database (Johnson et al., 2002) consists of almost 400 frames, and covers about 6,900 lexical entries.</Paragraph> <Paragraph position="3"> involves a FE SPEAKER who voices the request, an ADDRESSEE who is asked to do something, the MESSAGE, the request that is made, the TOPIC that the request is about, and the MEDIUM that is used to convey the request. Among the FEEs for this frame are the verb ask and the noun request. In the frame COMMERCIAL TRANSACTION (henceforth C T), a BUYER gives MONEY to a SELLER and receives GOODS in exchange. This frame is evoked e.g. by the verb pay and the noun money.</Paragraph> <Paragraph position="4"> The TIGER Corpus. We are using the TIGER Corpus (Brants et al., 2002), a manually syntactically annotated German corpus, as a basis for our annotation. It is the largest available such corpus (80,000 sentences in its final release compared to 20,000 sentences in its predecessor NEGRA) and uses a rich annotation format. The annotation scheme is surface oriented and comparably theoryneutral. Individual words are labelled with POS information. The syntactic structures of sentences are described by relatively flat trees providing information about grammatical functions (on edge labels), syntactic categories (on node labels), and argument structure of syntactic heads (through the use of dependency-oriented constituent structures, which are close to the syntactic surface). An example for a syntactic structure is given in Figure 2.</Paragraph> </Section> class="xml-element"></Paper>