This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Jan Christian Blaise Cruz, Alham Fikri Aji

Articles by Jan Christian Blaise Cruz, Alham Fikri Aji

Academic · 1 min

LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

arXiv:2603.23292v1 Announce Type: new Abstract: Benchmarks and leaderboards are how NLP most often communicates progress, but in the LLM era they are increasingly easy to …

2 views Mar 25

Jan Christian Blaise Cruz, Alham Fikri Aji

Articles by Jan Christian Blaise Cruz, Alham Fikri Aji

LLM Olympiad: Why Model Evaluation Needs a Sealed Exam

JCG, PC

HSOLLC Co., Ltd.