Maximum Entropy Exploration Without the Rollouts
arXiv:2603.12325v1 Announce Type: cross Abstract: Efficient exploration remains a central challenge in reinforcement learning, serving as a useful pretraining objective for data collection, particularly when …
Jacob Adamczyk, Adam Kamoski, Rahul V. Kulkarni
28 views