Skip to main content

All Articles

Articles

Academic · 1 min

Logit Distance Bounds Representational Similarity

arXiv:2602.15438v1 Announce Type: new Abstract: For a broad family of discriminative models that includes autoregressive language models, identifiability results imply that if two models induce …

Beatrix M. B. Nielsen, Emanuele Marconato, Luigi Gresele, Andrea Dittadi, Simon Buchholz
5 views
Academic · 1 min

Benchmarking IoT Time-Series AD with Event-Level Augmentations

arXiv:2602.15457v1 Announce Type: new Abstract: Anomaly detection (AD) for safety-critical IoT time series should be judged at the event level: reliability and earliness under realistic …

Dmitry Zhevnenko, Ilya Makarov, Aleksandr Kovalenko, Fedor Meshchaninov, Anton Kozhukhov, Vladislav Travnikov, Makar Ippolitov, Kirill Yashunin, Iurii Katser
7 views
Academic · 1 min

POP: Prior-fitted Optimizer Policies

arXiv:2602.15473v1 Announce Type: new Abstract: Optimization refers to the task of finding extrema of an objective function. Classical gradient-based optimizers are highly sensitive to hyperparameter …

Jan Kobiolka, Christian Frey, Gresa Shala, Arlind Kadra, Erind Bedalli, Josif Grabocka
4 views
Academic · 1 min

LLM-as-Judge on a Budget

arXiv:2602.15481v1 Announce Type: new Abstract: LLM-as-a-judge has emerged as a cornerstone technique for evaluating large language models by leveraging LLM reasoning to score prompt-response pairs. …

Aadirupa Saha, Aniket Wagde, Branislav Kveton
5 views
Academic · 1 min

Approximation Theory for Lipschitz Continuous Transformers

arXiv:2602.15503v1 Announce Type: new Abstract: Stability and robustness are critical for deploying Transformers in safety-sensitive settings. A principled way to enforce such behavior is to …

Takashi Furuya, Davide Murari, Carola-Bibiane Sch\"onlieb
7 views