Skip to content

Model for arXiv text class

  • Description: This is an AI benchmark to evaluate how accurately text data is classified into different categories, using the arXiv dataset. Here we use accuracy of classification (ACC) to compare how well each model classifies the text data, comparing to the ground truth classification of the arXiv categories.


Reference(s): https://github.com/usnistgov/chemnlp, https://doi.org/10.48550/arXiv.1905.00075

Model benchmarks

Model nameDataset Accuracy Team name Dataset size Date submitted Notes
logisticreg_model_text_title_abstractarXiv0.8597ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
random_forest_text_abstractarXiv0.884ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
svc_model_text_abstractarXiv0.9082ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
logisticreg_model_text_abstractarXiv0.8543ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
svc_model_text_title_abstractarXiv0.9082ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
random_forest_text_title_abstractarXiv0.8854ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
logisticreg_model_text_titlearXiv0.7903ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
svc_model_text_titlearXiv0.8469ChemNLP10099401-14-2023CSV, JSON, run.sh, Info
random_forest_text_titlearXiv0.8681ChemNLP10099401-14-2023CSV, JSON, run.sh, Info