Assistant Professor, Virginia Polytechnic Institute and State University
1 paper at NeurIPS 2025
In this paper, we study how preference data distribution influences DPO, from both theoretical and empirical perspectives.