PhD student, University of Warsaw
1 paper at NeurIPS 2025
We introduce faithful interaction explanations of CLIP and SigLIP models (FIxLIP), offering a unique perspective on interpreting image–text similarity predictions.