PhD student, University of Basel
1 paper at NeurIPS 2025
Aligning pretrained unimodal models with the proposed framework using limited paired data yields ~52% gains in cross-modality zero-shot classification and ~92% in retrieval.