Reinforcement Learning with Verifiable Reward

2 papers across 2 sessions

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

#1906 · Yang Yue, Zhiqi Chen, Rui Lu, Andrew Zhao, Zhaokai Wang, Yang Yue, Shiji Song, Gao Huang

We systematically examine the current state of RLVR and surprisingly find that it does not elicit fundamentally new reasoning patterns—revealing a gap between the potential of RL and the actual impact of current RLVR methods.

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning

#4009 · Ruiyang Zhou, Shuozhe Li, Amy Zhang, Liu Leqi