PhD student, University of Massachusetts at Amherst
1 paper at NeurIPS 2025
We introduce TalkCuts, a large-scale dataset for multi-shot speech video generation, and demonstrate its utility through a simple LLM-guided generation baseline.