Full Professor, Tsinghua University
2 papers at NeurIPS 2025
We propose a proactive defensive framework against malicious production LLM-based speech synthesis to protect our voice information.
LeVo generates high-quality songs that closely follow instruction by pairing paralled mixed and dual-track token prediction with three-stage training and DPO-based multi-preference alignment.