?
today
local_bar
search
Process-Supervision
1 paper across 1 session
Poster Session 3
1 paper
Thursday, December 4, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
star
#305
·
Wenlin Zhang, Xiangyang Li, Kuicai Dong, Yichao Wang, Pengyue Jia, Xiaopeng Li, Yingyi Zhang, Derong Xu, Zhaocheng Du, Huifeng Guo, Ruiming Tang, Xiangyu Zhao