COMPASS-Hedge: Learning Safely Without Knowing the World
arXiv:2603.22348v1 Announce Type: new Abstract: Online learning algorithms often faces a fundamental trilemma: balancing regret guarantees between adversarial and stochastic settings and providing baseline safety …
Ting Hu, Luanda Cai, Manolis Vlatakis
4 views