Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective
arXiv:2602.23816v1 Announce Type: new Abstract: Given a set of trajectories demonstrating the execution of a task safely in a constrained MDP with observable rewards but …
George Papadopoulos, George A. Vouros
12 views