PhD student, University of Texas at Austin
2 papers at NeurIPS 2025
Debugging today's chain-of-thought reasoning for video understanding and introducing a novel reward for MLLM post-training to ground video reasoning in visual evidence.
Finding that today's LMMs poorly grasp the arrow of time in video, we propose ArrowRL to enhance their temporal perception and AoTBench for rigorous evaluation.