Skip to main content

Category

Academic

Academic · 1 min

Weight space Detection of Backdoors in LoRA Adapters

arXiv:2602.15195v1 Announce Type: cross Abstract: LoRA adapters let users fine-tune large language models (LLMs) efficiently. However, LoRA adapters are shared through open repositories like Hugging …

David Puertolas Merenciano, Ekaterina Vasyagina, Raghav Dixit, Kevin Zhu, Ruizhe Li, Javier Ferrando, Maheep Chaudhary
5 views
Academic · 1 min

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

arXiv:2602.15198v1 Announce Type: cross Abstract: Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a …

Mason Nakamura, Abhinav Kumar, Saswat Das, Sahar Abdelnabi, Saaduddin Mahmud, Ferdinando Fioretto, Shlomo Zilberstein, Eugene Bagdasarian
9 views
Academic · 1 min

How to Train Your Long-Context Visual Document Model

arXiv:2602.15257v1 Announce Type: cross Abstract: We present the first comprehensive, large-scale study of training long-context vision language models up to 344K context, targeting long-document visual …

Austin Veselka
6 views
Academic · 1 min

The Information Geometry of Softmax: Probing and Steering

arXiv:2602.15293v1 Announce Type: cross Abstract: This paper concerns the question of how AI systems encode semantic structure into the geometric structure of their representation spaces. …

Kiho Park, Todd Nief, Yo Joong Choe, Victor Veitch
5 views
Academic · 1 min

Near-Optimal Sample Complexity for Online Constrained MDPs

arXiv:2602.15076v1 Announce Type: new Abstract: Safety is a fundamental challenge in reinforcement learning (RL), particularly in real-world applications such as autonomous driving, robotics, and healthcare. …

Chang Liu, Yunfan Li, Lin F. Yang
3 views