File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/94/a94-1011_concl.xml

Size: 2,249 bytes

Last Modified: 2025-10-06 13:57:08

<?xml version="1.0" standalone="yes"?>
<Paper uid="A94-1011">
  <Title>ing, Word Associations and Typical Predicate-Argument</Title>
  <Section position="9" start_page="69" end_page="69" type="concl">
    <SectionTitle>
7 Conclusion
</SectionTitle>
    <Paragraph position="0"> The significant theoretical result is that as the sophistication of the representation of abstracts is increased, the performance of the single term model improves, while the performance of the term weighting models does not improve significantly. This has been a fairly universal experience among researchers working within the term weighting classification paradigm.</Paragraph>
    <Paragraph position="1"> Although there is a very marginally significant improvement from using linguistically sophisticated representations over simple sequence representations if all of the sequences are represented, this largely (though not entirely) disappears when only most specific sequences are considered, so it might be a result of the effects discussed in section 3.2.</Paragraph>
    <Paragraph position="2"> The rule based assignment strategy exploits the Single Term model's estimates, and also performs much better on word sequence representations than on word set representations. This assignment strategy is promising because it can exploit more sophisticated representations well, has a sound theory behind it, and will assign descriptors only where it has enough information to do so. Some of the descriptors in the RAPRA corpus, for example, are only ever assigned from the entire article from which the abstract is taken, so no assignment strategy will ever do well on these. On the other hand this model also shows promise that IR techniques might be applied to help infer linguistic resources such as term banks from large classified corpora.</Paragraph>
    <Paragraph position="3"> The next stage is to add more sophisticated linguistic annotation to corpora, and to trawl for rules in boolean combinations of descriptors, thus addressing the results of section 3.2. In this way this work can be considered similar in spirit to that undertaken by Apte et al (1994), but differs in the forms of representation which are being considered for documents.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML