PhD student, Institute of Computer Science, Ludwig-Maximilians-Universität München
1 paper at NeurIPS 2025
We introduce faithful interaction explanations of CLIP and SigLIP models (FIxLIP), offering a unique perspective on interpreting image–text similarity predictions.