Full Professor, Institute of Computing Technology
4 papers at NeurIPS 2025
This work improves CLIP’s visual detail capturing ability by inverting the unCLIP generative model, which we find suitable for achieving this goal.
We introduce LogitGap, a training-free and post-hoc OOD detector that achieves state-of-the-art performance for both vision-language and vision-only models.