Assistant Professor, University of North Carolina, Chapel Hill
2 papers at NeurIPS 2025
We introduce ExAct, a benchmark for evaluating video-language models on expert-level understanding of fine-grained physical human activities across diverse real-world domains.
ReAgent-V enables reward-driven, multi-agent video understanding with dynamic reflection and frame selection.