Skip navigation
Skip navigation

Engineering a multi-purpose test collection for Web retrieval experiements

Bailey, Peter; Craswell, Nick; Hawking, David


Past research into text retrieval methods for the Web has been restricted by the lack of a test collection capable of supporting experiments which are both realistic and reproducible. The 1.69 million document WT10g collection is proposed as a multi-purpose testbed for experiments with these attributes, in distributed IR, hyperlink algorithms and conventional ad hoc retrieval. WT10g was constructed by selecting from a superset of documents in such a way that desirable corpus properties were...[Show more]

CollectionsANU Research Publications
Date published: 2003
Type: Journal article
Source: Information Processing and Management
DOI: 10.1016/S0306-4573(02)00084-5


There are no files associated with this item.

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  23 August 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator