PhD student, Imperial College London
1 paper at NeurIPS 2025
In this paper, we study how preference data distribution influences DPO, from both theoretical and empirical perspectives.