MS student, Indian Institute of Science, Bangalore
1 paper at NeurIPS 2025
We offer statistically robust methods for preference learning that leverage response time in the estimation of rewards to yield large improvements in statistical efficiency.