MS student, University of Massachusetts at Amherst
2 papers at NeurIPS 2025
CameraBench is a large-scale effort that pushes video-language models to reason about the language of camera motion just like professional cinematographers.
We introduce TalkCuts, a large-scale dataset for multi-shot speech video generation, and demonstrate its utility through a simple LLM-guided generation baseline.