?
today
local_bar
search
post training
2 papers across 1 session
Poster Session 2
2 papers
Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
Better Estimation of the Kullback--Leibler Divergence Between Language Models
star
#1902
·
Afra Amini, Tim Vieira, Ryan Cotterell
GVPO: Group Variance Policy Optimization for Large Language Model Post-Training
star
#3415
·
Kaichen Zhang, Yuzhong Hong, Junwei Bao, Hongfei Jiang, Yang Song, Hong Dingqian, Hui Xiong