?
today
local_bar
search
value-based reinforcement learning
1 paper across 1 session
Poster Session 6
1 paper
Friday, December 5, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
star
#215
·
Yurun Yuan, Fan Chen, Zeyu Jia, Alexander Rakhlin, Tengyang Xie