today local_bar

Mingyu Chen

PhD student, Boston University, Boston University

2 papers at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

#513 · Kianté Brantley, Mingyu Chen, Zhaolin Gao, Jason D. Lee, Wen Sun, Wenhao Zhan, Xuezhou Zhang

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Avoiding exp(R) scaling in RLHF through Preference-based Exploration

#3318 · Mingyu Chen, Yiding Chen, Wen Sun, Xuezhou Zhang

We introduce a new online RLHF algorithm that for the first time achieves a sample complexity that scales polynomially with the reward scale.