File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/n06-1050_abstr.xml
Size: 1,175 bytes
Last Modified: 2025-10-06 13:44:54
<?xml version="1.0" standalone="yes"?> <Paper uid="N06-1050"> <Title>Creating a Test Collection for Citation-based IR Experiments</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We present an approach to building a test collection of research papers. The approach is based on the Cran eld 2 tests but uses as its vehicle a current conference; research questions and relevance judgements of all cited papers are elicited from conference authors. The resultant test collection is different from TREC's in that it comprises scienti c articles rather than newspaper text and, thus, allows for IR experiments that include citation information. The test collection currently consists of 170 queries with relevance judgements; the document collection is the ACL Anthology. We describe properties of our queries and relevance judgements, and demonstrate the use of the test collection in an experimental setup. One potentially problematic property of our collection is that queries have a low number of relevant documents; we discuss ways of alleviating this.</Paragraph> </Section> class="xml-element"></Paper>