Poster Session 4 · Thursday, December 4, 2025 4:30 PM → 7:30 PM
#406
Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms
Abstract
We consider a stochastic multi-armed bandit problem with i.i.d. rewards where the expected reward function is multimodal with at most modes.
We propose the first known computationally tractable algorithm for computing the solution to the Graves-Lai optimization problem, which in turn enables the implementation of asymptotically optimal algorithms for this bandit problem.