?
today
local_bar
search
Yu-Yang Qian
PhD student, Nanjing University
1 paper at NeurIPS 2025
Homepage
·
OpenReview
·
Semantic Scholar
·
Google Scholar
Poster Session 5
1 paper
Friday, December 5, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Provably Efficient Online RLHF with One-Pass Reward Modeling
star
#413
·
Long-Fei Li, Yu-Yang Qian, Peng Zhao, Zhi-Hua Zhou