PhD student, Peking University
1 paper at NeurIPS 2025
Masked Diffusion Models can generate low-perplexity text efficiently, but can not handle tasks requiring high accuracy efficiently.