?
today
local_bar
search
LLM Reasoning; Adaptive Thinking; Reinforcement Learning
1 paper across 1 session
Poster Session 6
1 paper
Friday, December 5, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
star
#5003
·
Songjun Tu, Jiahao Lin, Qichao Zhang, Xiangyu Tian, Linjing Li, Xiangyuan Lan, Dongbin Zhao