PhD student, University of Sydney, University of Sydney
1 paper at NeurIPS 2025
In this paper, we study how preference data distribution influences DPO, from both theoretical and empirical perspectives.