File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/96/c96-2213_concl.xml
Size: 1,295 bytes
Last Modified: 2025-10-06 13:57:41
<?xml version="1.0" standalone="yes"?> <Paper uid="C96-2213"> <Title>Using a Hybrid System of Corpusand Knowledge-Based Techniques to Automate the Induction of a Lexical Sublanguage Grammar</Title> <Section position="5" start_page="1165" end_page="1165" type="concl"> <SectionTitle> 6. Conclusion </SectionTitle> <Paragraph position="0"> The category space is the arbiter of paradigmatic relatedness, and since it is bootstrapped from a training corpus that is.representative for the domain sublanguage, the resulting lexical entries will be customized for that domain. Porting the lexicon to a new domain is as simple as bootstrapping another category space. Experiments with PUNDIT, a broad-coverage symbolic NLP system, have shown that the category space can successfully bc used to induce features like transitivity and subcategorization for clauses and infinitival complements.</Paragraph> <Paragraph position="1"> The advantage of combining data-driven mining with the existing lexical knowledgebase over other bootstrapping methods is that this approach does not require the manual identification of appropriate cues for subcategorization features, or the involved construction of a pattern matcher that is sophisticated enough to ignore false triggers.</Paragraph> </Section> class="xml-element"></Paper>