Learning to Negotiate: Multi-Agent Deliberation for Collective Value Alignment in LLMs
arXiv:2603.10476v1 Announce Type: new Abstract: The alignment of large language models (LLMs) has progressed substantially in single-agent settings through paradigms such as RLHF and Constitutional …
Panatchakorn Anantaprayoon, Nataliia Babina, Nima Asgharbeygi, Jad Tarifi
9 views