Junpei Komiyama

Assistant Professor, Mohamed bin Zayed University of Artificial Intelligence

1 paper at NeurIPS 2025

Poster Session 5

We propose an algorithm for estimating the best mean reward in a multi-armed bandit with asymptotically optimal, instance-adaptive sample complexity.