Mean Flow Policy with Instantaneous Velocity Constraint for One-step Action Generation
arXiv:2602.13810v1 Announce Type: new Abstract: Learning expressive and efficient policy functions is a promising direction in reinforcement learning (RL). While flow-based policies have recently proven …
Guojian Zhan, Letian Tao, Pengcheng Wang, Yixiao Wang, Yiheng Li, Yuxin Chen, Masayoshi Tomizuka, Shengbo Eben Li
5 views