File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-2601_abstr.xml
Size: 1,188 bytes
Last Modified: 2025-10-06 13:45:34
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-2601"> <Title>Maximum Entropy Tagging with Binary and Real-Valued Features</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Recent literature on text-tagging reported successful results by applying Maximum Entropy (ME) models. In general, ME taggers rely on carefully selected binary features, which try to capture discriminant information from the training data. This paper introduces a standard setting of binary features, inspired by the literature on named-entity recognition and text chunking, and derives corresponding real-valued features based on smoothed logprobabilities. The resulting ME models have orders of magnitude fewer parameters. Effective use of training data to estimate features and parameters is achieved by integrating a leaving-one-out method into the standard ME training algorithm.</Paragraph> <Paragraph position="1"> Experimental results on two tagging tasks show statistically significant performance gains after augmenting standard binaryfeature models with real-valued features.</Paragraph> </Section> class="xml-element"></Paper>