PhD student, The Chinese University of Hong Kong (Shenzhen)
2 papers at NeurIPS 2025
LeVo generates high-quality songs that closely follow instruction by pairing paralled mixed and dual-track token prediction with three-stage training and DPO-based multi-preference alignment.
This paper presents SongBloom, the first unified autoregressive diffusion model for long-form song generation, achieving state-of-the-art performance compared to both commercial and non-commercial methods.