pure exploration

2 papers across 2 sessions

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Optimal Estimation of the Best Mean in Multi-Armed Bandits

#3309 · Takayuki Osogami, Junya Honda, Junpei Komiyama

We propose an algorithm for estimating the best mean reward in a multi-armed bandit with asymptotically optimal, instance-adaptive sample complexity.

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

FraPPE: Fast and Efficient Preference-Based Pure Exploration

#3202 · Udvas Das, Apurv Shukla, Debabrota Basu

A computationally efficient algorithm for identifying the exact Pareto optimal set with fixed confidence and any preference cone in a vector-valued Bandit. FraPPE is provably asymptotically optimal and numerically achieves the least sample complexity