File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/w05-0608_abstr.xml

Size: 1,025 bytes

Last Modified: 2025-10-06 13:44:37

<?xml version="1.0" standalone="yes"?>
<Paper uid="W05-0608">
  <Title>Domain Kernels for Text Categorization</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper we propose and evaluate a technique to perform semi-supervised learning for Text Categorization. In particular we de ned a kernel function, namely the Domain Kernel, that allowed us to plug external knowledge into the supervised learning process. External knowledge is acquired from unlabeled data in a totally unsupervised way, and it is represented by means of Domain Models. null We evaluated the Domain Kernel in two standard benchmarks for Text Categorization with good results, and we compared its performance with a kernel function that exploits a standard bag-of-words feature representation. The learning curves show that the Domain Kernel allows us to reduce drastically the amount of training data required for learning.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML