Overview - Million LLMs Track (MLLM) 2025¶
Proceedings | Data | Runs | Participants
The Million LLMs Track introduces a novel challenge: ranking large language models (LLMs) based on their expected ability to answer specific user queries. As organizations deploy ensembles of LLMs—ranging from general-purpose to domain-specific it becomes crucial to determine which models to consult for a given task. This track focuses on evaluating systems that can effectively identify the most capable LLM(s) for a query, without issuing new queries to the models.
Track coordinator(s):
- Evangelos Kanoulas, University of Amsterdam
- Panagiotis Eustratiadis, University of Amsterdam
- Mark Sanderson, RMIT University
- Jamie Callan, Carnegie Mellon University
Tasks:
trec2025-mllm-main: Main task
Track Web Page: https://trec-mllm.github.io/