Skip to content

TREC Browser

Overview - Million LLMs Track (MLLM) 2025

usnistgov/trec-browser

TREC Browser

usnistgov/trec-browser

Home
TREC-COVID
TREC-COVID
- Overview
- Round 1
  Round 1
  - Overview
  - Data
  - Participants
  - Runs
  - Results
- Round 2
  Round 2
  - Overview
  - Data
  - Participants
  - Runs
  - Results
- Round 3
  Round 3
  - Overview
  - Data
  - Participants
  - Runs
  - Results
- Round 4
  Round 4
  - Overview
  - Data
  - Participants
  - Runs
  - Results
- Round 5
  Round 5
  - Overview
  - Data
  - Participants
  - Runs
  - Results
TREC-34 (2025)
TREC-34 (2025)
- Overview
- Proceedings
- Adhoc Video Search
  Adhoc Video Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- BioGen
  BioGen
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Detection, Retrieval, and Augmented Generation for Understanding News (DRAGUN)
  Detection, Retrieval, and Augmented Generation for Understanding News (DRAGUN)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Interactive Knowledge Assistance Track (IKAT)
  Interactive Knowledge Assistance Track (IKAT)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Million LLMs Track (MLLM)
  Million LLMs Track (MLLM)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Product Search and Recommendation
  Product Search and Recommendation
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Retrieval Augmented Generation (RAG)
  Retrieval Augmented Generation (RAG)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- RAG TREC Instrument for Multilingual Evaluation (RAGTIME)
  RAG TREC Instrument for Multilingual Evaluation (RAGTIME)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tip of the Tongue (TOT)
  Tip of the Tongue (TOT)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Video Question Answering (VQA)
  Video Question Answering (VQA)
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-33 (2024)
TREC-33 (2024)
- Overview
- Proceedings
- Adhoc Video Search
  Adhoc Video Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- AToMiC
  AToMiC
  - Overview
  - Data
  - Participants
  - Runs
- Biomedical Generative Retrieval (BioGen) Track
  Biomedical Generative Retrieval (BioGen) Track
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Interactive Knowledge Assistance
  Interactive Knowledge Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Lateral Reading
  Lateral Reading
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Medical Video Question Answering
  Medical Video Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- NeuCLIR
  NeuCLIR
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Plain-Language Adaptation of Biomedical Abstracts
  Plain-Language Adaptation of Biomedical Abstracts
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Product Search
  Product Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Retrieval-Augmented Generation
  Retrieval-Augmented Generation
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tip-of-the-Tongue
  Tip-of-the-Tongue
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Video-To-Text
  Video-To-Text
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-32 (2023)
TREC-32 (2023)
- Overview
- Proceedings
- Clinical Trials
  Clinical Trials
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- CrisisFACTs
  CrisisFACTs
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Deep Learning
  Deep Learning
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Interactive Knowledge Assistance
  Interactive Knowledge Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- NeuCLIR
  NeuCLIR
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- AToMiC
  AToMiC
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Product Search
  Product Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tip-of-the-Tongue
  Tip-of-the-Tongue
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-31 (2022)
TREC-31 (2022)
- Overview
- Proceedings
- NeuCLIR
  NeuCLIR
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Health Misinformation
  Health Misinformation
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Deep Learning
  Deep Learning
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Conversational Assistance
  Conversational Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Clinical Trials
  Clinical Trials
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Fair Ranking
  Fair Ranking
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- CrisisFACTs
  CrisisFACTs
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-30 (2021)
TREC-30 (2021)
- Overview
- Proceedings
- Incident Streams
  Incident Streams
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- News
  News
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Fair Ranking
  Fair Ranking
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Deep Learning
  Deep Learning
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Clinical Trials
  Clinical Trials
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Conversational Assistance
  Conversational Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Health Misinformation
  Health Misinformation
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Podcast
  Podcast
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-29 (2020)
TREC-29 (2020)
- Overview
- Proceedings
- News
  News
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Deep Learning
  Deep Learning
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Incident Streams
  Incident Streams
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Health Misinformation
  Health Misinformation
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Conversational Assistance
  Conversational Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Precision Medicine
  Precision Medicine
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Podcast
  Podcast
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Fair Ranking
  Fair Ranking
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-28 (2019)
TREC-28 (2019)
- Overview
- Proceedings
- Complex Answer Retrieval
  Complex Answer Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Deep Learning
  Deep Learning
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Precision Medicine
  Precision Medicine
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Conversational Assistance
  Conversational Assistance
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- News
  News
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Decision
  Decision
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Fair Ranking
  Fair Ranking
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Incident Streams
  Incident Streams
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-27 (2018)
TREC-27 (2018)
- Overview
- Proceedings
- Precision Medicine
  Precision Medicine
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Common Core
  Common Core
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Real-time Summarization
  Real-time Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Complex Answer Retrieval
  Complex Answer Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- News
  News
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Incident Streams
  Incident Streams
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- CENTRE
  CENTRE
TREC-26 (2017)
TREC-26 (2017)
- Overview
- Proceedings
- Common Core
  Common Core
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Precision Medicine
  Precision Medicine
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- LiveQA
  LiveQA
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Real-time Summarization
  Real-time Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Complex Answer Retrieval
  Complex Answer Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tasks
  Tasks
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Dynamic Domain
  Dynamic Domain
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- OpenSearch
  OpenSearch
TREC-25 (2016)
TREC-25 (2016)
- Overview
- Proceedings
- Clinical Decision Support
  Clinical Decision Support
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- LiveQA
  LiveQA
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Contextual Suggestion
  Contextual Suggestion
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Real-time Summarization
  Real-time Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Total Recall
  Total Recall
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tasks
  Tasks
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Dynamic Domain
  Dynamic Domain
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- OpenSearch
  OpenSearch
TREC-24 (2015)
TREC-24 (2015)
- Overview
- Proceedings
- Clinical Decision Support
  Clinical Decision Support
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Microblog
  Microblog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Contextual Suggestion
  Contextual Suggestion
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Temporal Summarization
  Temporal Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Tasks
  Tasks
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Total Recall
  Total Recall
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Dynamic Domain
  Dynamic Domain
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- LiveQA
  LiveQA
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-23 (2014)
TREC-23 (2014)
- Overview
- Proceedings
- Clinical Decision Support
  Clinical Decision Support
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Contextual Suggestion
  Contextual Suggestion
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Microblog
  Microblog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Federated Web Search
  Federated Web Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Knowledge Base Acceleration
  Knowledge Base Acceleration
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Temporal Summarization
  Temporal Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Session
  Session
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-22 (2013)
TREC-22 (2013)
- Overview
- Proceedings
- Knowledge Base Acceleration
  Knowledge Base Acceleration
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Contextual Suggestion
  Contextual Suggestion
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Federated Web Search
  Federated Web Search
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Microblog
  Microblog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Temporal Summarization
  Temporal Summarization
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Session
  Session
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Crowdsourcing
  Crowdsourcing
TREC-21 (2012)
TREC-21 (2012)
- Overview
- Proceedings
- Microblog
  Microblog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Contextual Suggestion
  Contextual Suggestion
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Medical
  Medical
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Session
  Session
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Crowdsourcing
  Crowdsourcing
- Knowledge Base Acceleration
  Knowledge Base Acceleration
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-20 (2011)
TREC-20 (2011)
- Overview
- Proceedings
- Entity
  Entity
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Microblog
  Microblog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Chemical
  Chemical
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Medical
  Medical
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Session
  Session
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Crowdsourcing
  Crowdsourcing
TREC-19 (2010)
TREC-19 (2010)
- Overview
- Proceedings
- Blog
  Blog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Chemical
  Chemical
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Relevance Feedback
  Relevance Feedback
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Session
  Session
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Entity
  Entity
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-18 (2009)
TREC-18 (2009)
- Overview
- Proceedings
- Relevance Feedback
  Relevance Feedback
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Chemical
  Chemical
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Million Query
  Million Query
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Blog
  Blog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Entity
  Entity
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-17 (2008)
TREC-17 (2008)
- Overview
- Proceedings
- Blog
  Blog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Million Query
  Million Query
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Enterprise
  Enterprise
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Relevance Feedback
  Relevance Feedback
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-16 (2007)
TREC-16 (2007)
- Overview
- Proceedings
- Million Query
  Million Query
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Genomics
  Genomics
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Spam
  Spam
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Enterprise
  Enterprise
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Blog
  Blog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-15 (2006)
TREC-15 (2006)
- Overview
- Proceedings
- Terabyte
  Terabyte
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Spam
  Spam
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Genomics
  Genomics
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Enterprise
  Enterprise
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Blog
  Blog
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Legal
  Legal
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-14 (2005)
TREC-14 (2005)
- Overview
- Proceedings
- Spam
  Spam
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Terabyte
  Terabyte
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Genomics
  Genomics
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- HARD
  HARD
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Enterprise
  Enterprise
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Robust
  Robust
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-13 (2004)
TREC-13 (2004)
- Overview
- Proceedings
- Genomics
  Genomics
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- HARD
  HARD
- Novelty
  Novelty
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Robust
  Robust
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Terabyte
  Terabyte
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-12 (2003)
TREC-12 (2003)
- Overview
- Proceedings
- Genomics
  Genomics
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- HARD
  HARD
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Robust
  Robust
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Novelty
  Novelty
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-11 (2002)
TREC-11 (2002)
- Overview
- Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Novelty
  Novelty
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Video
  Video
TREC-10 (2001)
TREC-10 (2001)
- Overview
- Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Video
  Video
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-9 (2000)
TREC-9 (2000)
- Overview
- Proceedings
- Web
  Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Spoken Document Retrieval
  Spoken Document Retrieval
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Query
  Query
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-8 (1999)
TREC-8 (1999)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Large Web
  Large Web
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Query
  Query
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Question Answering
  Question Answering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Spoken Document Retrieval
  Spoken Document Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- GIRT
  GIRT
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-7 (1998)
TREC-7 (1998)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- High-Precision
  High-Precision
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Spoken Document Retrieval
  Spoken Document Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Query
  Query
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-6 (1997)
TREC-6 (1997)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Routing
  Routing
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Chinese
  Chinese
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Cross-Language
  Cross-Language
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- High-Precision
  High-Precision
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- NLP
  NLP
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Spoken Document Retrieval
  Spoken Document Retrieval
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Very Large Corpus
  Very Large Corpus
  - Overview
  - Proceedings
TREC-5 (1996)
TREC-5 (1996)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Database Merging
  Database Merging
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Routing
  Routing
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
- Spanish
  Spanish
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Chinese
  Chinese
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- NLP
  NLP
  - Overview
  - Participants
  - Runs
  - Results
  - Proceedings
- Confusion
  Confusion
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Proceedings
TREC-4 (1995)
TREC-4 (1995)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Database Merging
  Database Merging
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Routing
  Routing
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Spanish
  Spanish
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Filtering
  Filtering
- Confusion
  Confusion
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Interactive
  Interactive
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-3 (1994)
TREC-3 (1994)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Routing
  Routing
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-2 (1993)
TREC-2 (1993)
- Overview
- Proceedings
- Adhoc
  Adhoc
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
- Routing
  Routing
  - Overview
  - Data
  - Participants
  - Runs
  - Results
  - Proceedings
TREC-1 (1992)
TREC-1 (1992)
- Overview
- Adhoc
  Adhoc
  - Overview
  - Data
- Routing
  Routing
  - Overview
  - Data

Overview - Million LLMs Track (MLLM) 2025¶

Proceedings | Data | Runs | Participants

The Million LLMs Track introduces a novel challenge: ranking large language models (LLMs) based on their expected ability to answer specific user queries. As organizations deploy ensembles of LLMs—ranging from general-purpose to domain-specific it becomes crucial to determine which models to consult for a given task. This track focuses on evaluating systems that can effectively identify the most capable LLM(s) for a query, without issuing new queries to the models.

Track coordinator(s):

Evangelos Kanoulas, University of Amsterdam
Panagiotis Eustratiadis, University of Amsterdam
Mark Sanderson, RMIT University
Jamie Callan, Carnegie Mellon University

Tasks:

trec2025-mllm-main: Main task

Track Web Page: https://trec-mllm.github.io/