File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/99/p99-1051_concl.xml

Size: 1,144 bytes

Last Modified: 2025-10-06 13:58:29

<?xml version="1.0" standalone="yes"?>
<Paper uid="P99-1051">
  <Title>Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations</Title>
  <Section position="9" start_page="402" end_page="403" type="concl">
    <SectionTitle>
8 Conclusions
</SectionTitle>
    <Paragraph position="0"> This paper explored the degree to which diathesis alternations can be identified in corpus data via shallow syntactic processing. Alternating verbs were acquired from the BNC by using Gsearch as a chunk parser. Erroneous frames were discarded by applying linguistic heuristics, statistical scores (the log-likelihood ratio) and large-scale lexical resources  (e.g., WordNet).</Paragraph>
    <Paragraph position="1"> We have shown that corpus frequencies can be used to quantify linguistic intuitions and lexical generalizations such as Levin's (1993) semantic classification. Furthermore, corpus frequencies can make explicit predictions about word use. This was demonstrated by using the frequencies to estimate the productivity of an alternation for a given semantic class and the typicality of its members.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML