Research Professor, School of Computer Science, Carnegie Mellon University
1 paper at NeurIPS 2025
a small audio-language model for audio reasoning that achieves SoTA performance with 50 times fewer parameters and 60 times fewer audio hours.