Full Professor, Monash University
4 papers at NeurIPS 2025
We develop the unbiased Slice Wassertein RBF kernel to better measure cross-modal alignment between acoustic and linguistic modalities for audio captioning and reasoning tasks.
We introduce the first graph foundation model specifically designed for retrieval-augmented generation in large language models.