Skip to main content

Category

Academic

Academic · 1 min

A Unified Framework for Locality in Scalable MARL

arXiv:2602.16966v1 Announce Type: new Abstract: Scalable Multi-Agent Reinforcement Learning (MARL) is fundamentally challenged by the curse of dimensionality. A common solution is to exploit locality, …

Sourav Chakraborty, Amit Kiran Rege, Claire Monteleoni, Lijun Chen
6 views
Academic · 1 min

Fail-Closed Alignment for Large Language Models

arXiv:2602.16977v1 Announce Type: new Abstract: We identify a structural weakness in current large language model (LLM) alignment: modern refusal mechanisms are fail-open. While existing approaches …

Zachary Coalson, Beth Sohler, Aiden Gabriel, Sanghyun Hong
15 views
Academic · 1 min

Malliavin Calculus as Stochastic Backpropogation

arXiv:2602.17013v1 Announce Type: new Abstract: We establish a rigorous connection between pathwise (reparameterization) and score-function (Malliavin) gradient estimators by showing that both arise from the …

Kevin D. Oden
5 views