dueling bandit - NeurIPS 2025

today local_bar

dueling bandit

2 papers across 2 sessions

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options

#3310 · Joongkyu Lee, Seouh-won Yi, Min-hwan Oh

We present the first theoretical analysis of PbRL with ranking feedback, showing that longer ranking feedback can provably improve sample efficiency.

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data

#3102 · Qijia He, Minghan Wang, Xutong Liu, Zhiyong Wang, Fang Kong