This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Orin Levy, Yishay Mansour

Articles by Orin Levy, Yishay Mansour

Academic · 1 min

Optimal Regret for Policy Optimization in Contextual Bandits

arXiv:2602.13700v1 Announce Type: new Abstract: We present the first high-probability optimal regret bound for a policy optimization technique applied to the problem of stochastic contextual …

4 views Feb 18

Something extraordinary is coming.

Orin Levy, Yishay Mansour

Articles by Orin Levy, Yishay Mansour

Optimal Regret for Policy Optimization in Contextual Bandits

JCG, PC

HSOLLC Co., Ltd.