PhD student, School of Computer and Communication Sciences, EPFL - EPF Lausanne
1 paper at NeurIPS 2025
Aligning pretrained unimodal models with the proposed framework using limited paired data yields ~52% gains in cross-modality zero-shot classification and ~92% in retrieval.