File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/03/w03-1104_concl.xml
Size: 1,567 bytes
Last Modified: 2025-10-06 13:53:46
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-1104"> <Title>A Differential LSI Method for Document Classification</Title> <Section position="4" start_page="0" end_page="0" type="concl"> <SectionTitle> 4 Conclusion and Remarks </SectionTitle> <Paragraph position="0"> We have made use of the differential vectors of two normalized vectors rather than the mere scalar cosine of the angle of the two vectors in document classification procedure, providing a more effective means of document classifier. Obviously the concept of differential intra- and extra-document vectors imbeds a richer meaning than the mere scalar measure of cosine, focussing the characteristics of each document wheere the new classifier demonstrates an improved and robust performance in document classification than the LSI-based cosine approach. Our model considers both of the projections and the distances of the differential vectors to the DLSI spaces, improving the adaptability of the conventional LSI-based method to the unique characteristics of the individual documents which is a common weakness of the global projection schemes including the LSI. The simple experiment demonstrates convincingly that the performance of our model outperforms the standard LSI space-based approach. Just as the cross-language ability of LSI, DLSI method should also be able to be used for document classification of docuements in multiple languages. We have tested our method using larger collection of texts, we will give details of the results elsewhere. .</Paragraph> </Section> class="xml-element"></Paper>