Skip to main content

Category

Academic

Academic · 1 min

POP: Prior-fitted Optimizer Policies

arXiv:2602.15473v1 Announce Type: new Abstract: Optimization refers to the task of finding extrema of an objective function. Classical gradient-based optimizers are highly sensitive to hyperparameter …

Jan Kobiolka, Christian Frey, Gresa Shala, Arlind Kadra, Erind Bedalli, Josif Grabocka
3 views
Academic · 1 min

LLM-as-Judge on a Budget

arXiv:2602.15481v1 Announce Type: new Abstract: LLM-as-a-judge has emerged as a cornerstone technique for evaluating large language models by leveraging LLM reasoning to score prompt-response pairs. …

Aadirupa Saha, Aniket Wagde, Branislav Kveton
4 views
Academic · 1 min

Approximation Theory for Lipschitz Continuous Transformers

arXiv:2602.15503v1 Announce Type: new Abstract: Stability and robustness are critical for deploying Transformers in safety-sensitive settings. A principled way to enforce such behavior is to …

Takashi Furuya, Davide Murari, Carola-Bibiane Sch\"onlieb
6 views