Retaining Suboptimal Actions to Follow Shifting Optima in Multi-Agent Reinforcement Learning
arXiv:2602.17062v1 Announce Type: new Abstract: Value decomposition is a core approach for cooperative multi-agent reinforcement learning (MARL). However, existing methods still rely on a single …
Yonghyeon Jo, Sunwoo Lee, Seungyul Han
5 views