Research Engineer, FAIR
1 paper at NeurIPS 2025
We introduce NaturalReasoning, a 2.8M-question dataset spanning diverse domains, enabling effective knowledge distillation and unsupervised self-training to enhance LLM reasoning capabilities.