Full Professor, University of Trento
3 papers at NeurIPS 2025
We introduce the task of Concept-based Video Similarity estimation, curate a dedicated benchmark and evaluate models' performance.
LT-Soups merges CLIP models fine-tuned on balanced subsets and retrains the classifier on the full dataset, achieving SOTA head/tail accuracy trade-offs across five benchmarks.