PhD student, National University of Singapore
1 paper at NeurIPS 2025
NoisyRollout boosts VLM reasoning by mixing clean and noisy inputs during RL, improving generalization with no extra cost.