Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback
arXiv:2602.20728v1 Announce Type: new Abstract: Reward design has been one of the central challenges for real world reinforcement learning (RL) deployment, especially in settings with …
Chenyang Zhao, Vinny Cahill, Ivana Dusparic
10 views