?
today
local_bar
search
Process Reward Function
1 paper across 1 session
Poster Session 6
1 paper
Friday, December 5, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs
star
#214
·
Jingyao Wang, Wenwen Qiang, Zeen Song, Changwen Zheng, Hui Xiong