?
today
local_bar
search
Self-play Optimization
1 paper across 1 session
Poster Session 5
1 paper
Friday, December 5, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs
star
#3608
·
Yibo Wang, Hai-Long Sun, Guangda Huzhang, Qingguo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Lijun Zhang