Model for arXiv text summarization¶

Description: This is an AI benchmark to evaluate how well text data is summarized, using the arXiv dataset. Here we use the recall-oriented understudy for gisting evaluation (ROUGE) score as a metric.

Model benchmarks

Model name	Dataset	Rouge	Team name	Dataset size	Date submitted	Notes
transformers_t5_base	arxiv_summary	0.2602	ChemNLP	87148	01-14-2023	CSV, JSON, run.sh, Info