Skip to content

Overview - Million Query 2007

Proceedings | Data | Results | Runs | Participants

The goal of this track is to run a retrieval task similar to standard ad-hoc retrieval, but to evaluate large numbers of queries incompletely, rather than a small number more completely. Participants will run 10,000 queries and a random 1,000 or so will be evaluated. The corpus is the terabyte track's GOV2 corpus of roughly 25,000,000 .gov web pages, amounting to just under half a terabyte of data.

Track coordinator(s):

  • J. Allan, University of Massachusetts Amherst
  • B. Carterette, University of Massachusetts Amherst
  • B. Dachev, University of Massachusetts Amherst
  • J. A. Aslam, Northeastern University
  • V. Pavlu, Northeastern University
  • E. Kanoulas, Northeastern University

Tasks:

  • 01: Task 1
  • 10: Task 2

Track Web Page: https://web.archive.org/web/20090311232726/http://ciir.cs.umass.edu/research/million/