File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/99/p99-1045_abstr.xml

Size: 650 bytes

Last Modified: 2025-10-06 13:49:48

<?xml version="1.0" standalone="yes"?>
<Paper uid="P99-1045">
  <Title>Less is more: Eliminating index terms from subordinate clauses</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We perform a linguistic analysis of documents during indexing for information retrieval. By eliminating index terms that occur only in subordinate clauses, index size is reduced by approximately 30% without adversely affecting precision or recall. These results hold for two corpora: a sample of the world wide web and an electronic encyclopedia.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML