contextual bandit

4 papers across 3 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

True Impact of Cascade Length in Contextual Cascading Bandits

#3208 · Hyun-jun Choi, Joongkyu Lee, Min-hwan Oh

We show that in contextual cascading bandits, regret vanishes as the cascade length grows, with nearly matching upper and lower bounds.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities

#3102 · Taehyun Hwang, Dahngoon Kim, Min-hwan Oh

We propose a computationally tractable multinomial logit contextual bandit algorithm, which is designed to handle generic non-linear parametric utility functions.

Poster Session 5

2 papers

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Test-Time Scaling of Diffusion Models via Noise Trajectory Search

#611 · Vignav Ramesh, Morteza Mardani

We present an algorithm for test-time scaling of SDE-based diffusion models by searching for noise trajectories which optimize arbitrary rewards, empirically matching/exceeding MCTS performance.

Provably Efficient Online RLHF with One-Pass Reward Modeling

#413 · Long-Fei Li, Yu-Yang Qian, Peng Zhao, Zhi-Hua Zhou