PhD student, Vector Institute
1 paper at NeurIPS 2025
Introduced Refined Regularized Preference Optimization with a self-alignment framework to enable fine-grained alignment of large video language models by learning from their own errors.