This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Bogdan Kosti\'c, Conor Fallon, Julian Risch, Alexander L\"oser

Articles by Bogdan Kosti\'c, Conor Fallon, Julian Risch, Alexander L\"oser

Academic · 1 min

Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

arXiv:2602.17316v1 Announce Type: new Abstract: The rapid advancement of Large Language Models (LLMs) has established standardized evaluation benchmarks as the primary instrument for model comparison. …

8 views Feb 21

Something extraordinary is coming.

Bogdan Kosti\'c, Conor Fallon, Julian Risch, Alexander L\"oser

Articles by Bogdan Kosti\'c, Conor Fallon, Julian Risch, Alexander L\"oser

Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

JCG, PC

HSOLLC Co., Ltd.