Assistant Professor, Tsinghua University, Tsinghua University
2 papers at NeurIPS 2025
We show that knowledge acuiqistion under data mixing can exhibit phase transitions with respect to the mixing ratio and model size.