All Articles

Articles

Academic · 1 min

Trajectory-Informed Memory Generation for Self-Improving Agent Systems

arXiv:2603.10600v1 Announce Type: new Abstract: LLM-powered agents face a persistent challenge: learning from their execution experiences to improve future performance. While agents can successfully complete …

Gaodan Fang, Vatche Isahagian, K. R. Jayaram, Ritesh Kumar, Vinod Muthusamy, Punleuk Oum, Gegi Thomas
38 views
Academic · 1 min

FERRET: Framework for Expansion Reliant Red Teaming

arXiv:2603.10010v1 Announce Type: cross Abstract: We introduce a multi-faceted automated red teaming framework in which the goal is to generate multi-modal adversarial conversations that would …

Ninareh Mehrabi, Vitor Albiero, Maya Pavlova, Joanna Bitton
29 views
Academic · 1 min

Personalized Group Relative Policy Optimization for Heterogenous Preference Alignment

arXiv:2603.10009v1 Announce Type: cross Abstract: Despite their sophisticated general-purpose capabilities, Large Language Models (LLMs) often fail to align with diverse individual preferences because standard post-training …

Jialu Wang, Heinrich Peters, Asad A. Butt, Navid Hashemi, Alireza Hashemi, Pouya M. Ghari, Joseph Hoover, James Rae, Morteza Dehghani
124 views