Researcher, Ant Group
1 paper at NeurIPS 2025
We present LLaDA, a diffusion language model trained from scratch that is competitive to LLaMA 3 in performance.