Hierarchical Reward Design from Language: Enhancing Alignment of Agent Behavior with Human Specifications
arXiv:2602.18582v1 Announce Type: new Abstract: When training artificial intelligence (AI) to perform tasks, humans often care not only about whether a task is completed but …
Zhiqin Qian, Ryan Diaz, Sangwon Seo, Vaibhav Unhelkar
3 views