Skip to content

Overview - Terabyte 2005

Proceedings | Data | Results | Runs | Participants

The primary goal of the Terabyte Track is to develop an evaluation methodology for terabyte-scale document collections. In addition, we are interested in efficiency and scalability issues, which can be studied more easily in the context of a larger collection. Again this year, we are using a 426GB collection of Web data from the gov domain for all tasks. While this collection is less than a full terabyte in size, it is considerably larger than the collections used in previous TREC tracks. In future years, we hope to expand the collection using data from other sources.

Track coordinator(s):

  • C.L.A. Clarke, University of Waterloo
  • F. Scholer, Royal Melbourne Institute of Technology (RMIT University)
  • I. Soboroff, National Institute of Standards and Technology (NIST)

Tasks:

  • adhoc: Adhoc retrieval
  • namedpage: Named page finding
  • efficiency: Efficiency

Track Web Page: https://plg.uwaterloo.ca/~claclark/TB05.html