PhD student, Korea Advanced Institute of Science & Technology
1 paper at NeurIPS 2025
Larger vocabulary lowers language modeling difficulty by facilitating models to learn non-i.i.d patterns in text more easily