Associate Professor, University of Amsterdam
2 papers at NeurIPS 2025
We present an efficient algorithm for linear contextual bandits with adversarial losses and stochastic action sets.
We describe a framework that provides theoretical guarantees on the correctness of learning concepts from data and on the number of required labels.