?
today
local_bar
search
Self-Instruction;Self-Rewarding
1 paper across 1 session
Poster Session 2
1 paper
Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data
star
#411
·
Wenkai Fang, Shunyu Liu, Yang Zhou, Kongcheng Zhang, Tongya Zheng, Kaixuan Chen, Mingli Song, Dacheng Tao