1 paper across 1 session
A novel 3D audio-visual QA benchmark and training-free spatial reasoning pipeline for Audio-Visual LLMs