File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/97/a97-1044_concl.xml
Size: 2,257 bytes
Last Modified: 2025-10-06 13:57:46
<?xml version="1.0" standalone="yes"?> <Paper uid="A97-1044"> <Title>Building Effective Queries In Natural Language Information Retrieval</Title> <Section position="7" start_page="305" end_page="305" type="concl"> <SectionTitle> 7. CONCLUSIONS </SectionTitle> <Paragraph position="0"> We presented in some detail our natural language information retrieval system consisting of an advanced NLP module and a &quot;pure' statistical core engine. While many problems remain to be resolved, including the question of adequacy of term-based representation of document content, we attempted to demonstrate that the architecture described here is nonetheless viable. In particular, we demonstrated that natural language processing can now be done on a fairly large scale and that its speed and robustness have improved to the point where it can be applied to real IR problems.</Paragraph> <Paragraph position="1"> The main observation to make is that natural language processing is not as effective as we have once hoped to obtain better indexing and better term representations of queries. Using linguistic terms, such as phrases, head-modifier pairs, names, or even simple concepts does help to improve retrieval precision, but the gains remained quite modest. On the other hand, full text query expansion works remarkably well. Our main effort in the immediate future will be to explore ways to achieve at least partial automation of this process. An initial experiment in this direction has been performed as part of NLP Track (genlp3 run), and the results are encouraging.</Paragraph> <Paragraph position="2"> ACKNOWLEDGEMENTS. We would like to thank Donna Harman of NIST for making her PRISE system available to us since the beginning of TREC. Will Rogers and Paul Over provided valuable assistance in installing updated versions of PRISE. We would also like to thank Ralph Weischedel for providing the BBN's part of speech tagger. We acknowledge the following members of our TREC-5 team who participated in the query expansion experiments: Louise Guthrie, Jussi Karlgren, Jim Leistensnider, Troy Straszheim, and Jon Wilding. This paper is based upon work supported in part by the Advanced Research Projects Agency under</Paragraph> </Section> class="xml-element"></Paper>