Assistant Professor, State University of New York at Stony Brook
1 paper at NeurIPS 2025
We propose the first prior-free algorithm that achieves near-optimal dynamic regret for non-stationary multi-armed bandits under constrained feedback.