Music generation

3 papers across 2 sessions

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

We propose unified latent diffusion framework for simultaneous music generation, source imputation, and query-driven arbitrary source extraction.

Poster Session 6

2 papers

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

LeVo: High-Quality Song Generation with Multi-Preference Alignment

#1813 · Shun Lei, Yaoxun XU, ZhiweiLin, Huaicheng Zhang, Wei tan, Hangting Chen, Yixuan Zhang, Chenyu Yang, Haina Zhu, Shuai Wang, Zhiyong Wu, Dong Yu

LeVo generates high-quality songs that closely follow instruction by pairing paralled mixed and dual-track token prediction with three-stage training and DPO-based multi-preference alignment.

SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

#1807 · Chenyu Yang, Shuai Wang, Hangting Chen, Wei Tan, Jianwei Yu, Haizhou Li

This paper presents SongBloom, the first unified autoregressive diffusion model for long-form song generation, achieving state-of-the-art performance compared to both commercial and non-commercial methods.