PhD student, National University of Singapore
4 papers at NeurIPS 2025
Talk2Event is a new benchmark for attribute-aware visual grounding from event cameras.
We present 3EED, the first large-scale benchmark for 3D visual grounding across vehicles, drones, and quadrupeds, with over 134K 3D objects and 25K human-verified expressions in diverse outdoor scenes.
Spiral is a new type of LiDAR generation model that enables semantic awareness and progressive diffusion.