Emeritus, Microsoft
1 paper at NeurIPS 2025
Open-Reasoner-Zero, The first open source implementation of large-scale reasoning-oriented RL training focusing on scalability, simplicity and accessibility