Skip to content

Overview - Million LLMs Track (MLLM) 2025

Proceedings | Data | Runs | Participants

The Million LLMs Track introduces a novel challenge: ranking large language models (LLMs) based on their expected ability to answer specific user queries. As organizations deploy ensembles of LLMs—ranging from general-purpose to domain-specific it becomes crucial to determine which models to consult for a given task. This track focuses on evaluating systems that can effectively identify the most capable LLM(s) for a query, without issuing new queries to the models.

Track coordinator(s):

  • Evangelos Kanoulas, University of Amsterdam
  • Panagiotis Eustratiadis, University of Amsterdam
  • Mark Sanderson, RMIT University
  • Jamie Callan, Carnegie Mellon University

Tasks:

  • trec2025-mllm-main: Main task

Track Web Page: https://trec-mllm.github.io/