PhD student, National University of Singapore
3 papers at NeurIPS 2025
Large Reasoning Model learns when to think via Decoupled GRPO.
We introduce VeriThinker, a simple yet effective approach for CoT compression.
We propose the delayed KV-Cache for diffusion language models.