?
today
local_bar
search
RL post-training
1 paper across 1 session
Poster Session 1
1 paper
Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning
star
#4017
·
Kaiwen Zha, Zhengqi Gao, Maohao Shen, Zhang-Wei Hong, Duane Boning, Dina Katabi