Full Professor, sorbonne université
2 papers at NeurIPS 2025
We propose a novel architecture and training objective specifically designed to upsample features from foundation vision encoders at any resolution.
We propose CLIPTTA, a contrastive test-time adaptation method for CLIP that improves both accuracy and OOD detection in closed- and open-set settings.