Overview - BioGen 2025¶
Proceedings | Data | Runs | Participants
With the advancement of large language models (LLMs), the biomedical domain has seen significant progress and improvement in multiple tasks such as biomedical question answering, lay language summarization of the biomedical literature, clinical note summation, etc. However, hallucinations or confabulations remain one of the key challenges when using LLMs in the biomedical domain. Inaccuracies may be particularly harmful in high-risk situations, such as making clinical decisions or appraising biomedical research. Towards this, in our pilot task organized at TREC 2024, we introduced the task of reference attribution as a means to mitigate the generation of false statements by LLMs toward answering the biomedical question. We propose to continue this task with an additional task of grounding the answer in the BioGen track at TREC 2025. The goal of the TREC 2025 BioGen task will be to cite references to support the text of the sentences and the overall answer from LLM output for each topic.
Track coordinator(s):
- Deepak Gupta - National Library of Medicine, NIH
- Dina Demner-Fushman - National Library of Medicine, NIH
- Bill Hersh - Oregon Health & Science University
- Steven Bedrick - Oregon Health & Science University
- Kirk Roberts - UTHealth Houston
Tasks:
trec2025-biogen-task-a: Grounding answerstrec2025-biogen-task-b: Reference attribution
Track Web Page: https://trec-biogen.github.io/