Greedy Sampling

1 paper across 1 session

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Greedy Sampling Is Provably Efficient For RLHF

#3313 · Di Wu, Chengshuai Shi, Jing Yang, Cong Shen

This work shows that greedy sampling based on empirical estimates is provably efficient for RLHF, under both the general preference model and the Bradley-Terry model.