Model for arXiv text summarization¶
- Description: This is an AI benchmark to evaluate how well text data is summarized, using the arXiv dataset. Here we use the recall-oriented understudy for gisting evaluation (ROUGE) score as a metric.
Reference(s): https://doi.org/10.48550/arXiv.1905.00075, https://github.com/usnistgov/chemnlp
Model benchmarks
Model name | Dataset | Rouge | Team name | Dataset size | Date submitted | Notes |
---|---|---|---|---|---|---|
transformers_t5_base | arxiv_summary | 0.2602 | ChemNLP | 87148 | 01-14-2023 | CSV, JSON, run.sh, Info |