Undergrad student, Renmin University of China
1 paper at NeurIPS 2025
We present LLaDA, a diffusion language model trained from scratch that is competitive to LLaMA 3 in performance.