Researcher, Alibaba Group
1 paper at NeurIPS 2025
we introduce ThinkSound, a framework that utilizes Chain-of-Thought reasoning to systematically break down audio generation for videos into a step-by-step interactive process.