The CompMath-MCQ Dataset: Are LLMs Ready for Higher-Level Math?
arXiv:2603.03334v1 Announce Type: new Abstract: The evaluation of Large Language Models (LLMs) on mathematical reasoning has largely focused on elementary problems, competition-style questions, or formal …
Bianca Raimondi, Francesco Pivi, Davide Evangelista, Maurizio Gabbrielli
3 views