Full Professor, KU Leuven
1 paper at NeurIPS 2025
This paper introduces DAVE, a diagnostic benchmark that requires both audio and visual inputs and separates evaluation into subcategories to reveal specific failure modes in audio-visual models.