PhD student, national university of singaore, National University of Singapore
2 papers at NeurIPS 2025
We investigate an efficient and effective context-window scheduling method for language model pretraining.
NoisyRollout boosts VLM reasoning by mixing clean and noisy inputs during RL, improving generalization with no extra cost.