Full Professor, University of Tübingen
2 papers at NeurIPS 2025
We propose an efficient strategy for adversarial finetuning of the CLIP text encoder, enabling robustness in zero-shot classification, text-to-image retrieval and text-to-image generation.
A fine-tuning method for Compositionally-aware Learning in CLIP