File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/94/h94-1003_intro.xml

Size: 3,262 bytes

Last Modified: 2025-10-06 14:05:41

<?xml version="1.0" standalone="yes"?>
<Paper uid="H94-1003">
  <Title>The Comlex Syntax Project: The First Year</Title>
  <Section position="3" start_page="0" end_page="0" type="intro">
    <SectionTitle>
2. Structure
</SectionTitle>
    <Paragraph position="0"> The structure of COMLEX has been discussed at length in our report to the 1993 HLT Workshop so we will briefly touch on the details of our dictionary entry. The major classes (adjectives, nouns and verbs) are marked for features and complements (subcategorization frames), examples of which can be seen in Figure 1.</Paragraph>
    <Paragraph position="1"> Nouns have 9 possible features and 9 possible complements; adjectives have 7 features and 14 complements; and verbs have 5 features and 92 complements. Figure 2 shows some actual dictionary entries, including some entries for adverbs and prepositions.</Paragraph>
    <Paragraph position="2"> In order to insure the completeness of our codes, we studied the coding employed by several other major lexicons, including tire Brandeis Verb Lexicon 2, the ACQUILEX Project \[10\], the NYU Linguistic String Project \[9\], the OALD, and the LDOCE, and, whenever feasible, have sought to incorporate distinctions made in any of these  dictionaries. The names for the different complement types are based on the conventions used in the the Brandeis Verb Lexicon.The notation indicates the type and order of the elements (NP = noun phrase, PP = prepositional phrase, NP-PP = a noun phrase followed by a prepositional phrase, :pval = the selected prepositions).</Paragraph>
    <Paragraph position="3"> The subcategofization types are defined by frames. These frames which appear in our reference manual (see Figure 3) include the constituent structure :es, the grammatical structure :gs, optional :foaturo$ and one or more examples :ox. The features in the sub-categorization frames are not those in the dictionary but refer to the control or raising properties of the verb where applicable. In particular, they capture four different types of control: subject control, object control, variable control, and arbitrary control. Furthermore, the notation allows us to indicate that a verb may have different control features for different complement structures, or even for different prepositions within the complement. We record, for example, that &amp;quot;blame ... on&amp;quot; involves arbitrary control (&amp;quot;He blamed the problem on going too fast.&amp;quot;), whereas &amp;quot;blame for&amp;quot; involves object control (&amp;quot;He blamed John for going too fast.&amp;quot;).</Paragraph>
    <Paragraph position="4"> There are two complements represented by the frames in Figure 3, possing and ing-sz, possing stands for a frame group which includes two frames *possing (where the subject of the gerund is present) and *ing-ae (where the subjectis interpreted to be arbitrary).</Paragraph>
    <Paragraph position="5"> A verb which is assigned possing must be able to occur in both of these frames, ing-sc also stands for a frame group. It includes bo-ing-sC/ and *possing. Here the subject of the gerund must be the same as the surface subject and the possessive subject of *possing will be co-referential with the surface subject.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML