Skip to content

Overview - Terabyte 2004

Proceedings | Data | Results | Runs | Participants

The goal of the Terabyte Track is to develop an evaluation methodology for terabyte-scale document collections. This year's track uses a 426GB collection of Web data from the .gov domain. While this collection is less than a full terabyte in size, it is considerably larger than the collections used in previous TREC tracks. In future years, we plan to expand the collection using data from other sources.

Track coordinator(s):

  • C. Clarke, University of Waterloo
  • N. Craswell, Microsoft Research
  • I. Soboroff, National Institute of Standards and Technology (NIST)

Track Web Page: https://trec.nist.gov/data/terabyte/04/04.guidelines.html