Senior Staff Algorithm Engineer, Alibaba Group
2 papers at NeurIPS 2025
we introduce ThinkSound, a framework that utilizes Chain-of-Thought reasoning to systematically break down audio generation for videos into a step-by-step interactive process.