1 paper across 1 session
We show how to efficiently verify approximate optimaility of smooth policies and strategies in bandits and games