Associate Professor, National University of Singapore
2 papers at NeurIPS 2025
We investigate an efficient and effective context-window scheduling method for language model pretraining.