PhD student, École de technologie supérieure
1 paper at NeurIPS 2025
We show that interval estimation based methods produce better distilled embedders in multi-teacher distillation settings compared to MSE or Cosine base methods.