1 paper across 1 session
We introduce TalkCuts, a large-scale dataset for multi-shot speech video generation, and demonstrate its utility through a simple LLM-guided generation baseline.