Senior Lecturer, Monash University
1 paper at NeurIPS 2025
We develop the unbiased Slice Wassertein RBF kernel to better measure cross-modal alignment between acoustic and linguistic modalities for audio captioning and reasoning tasks.