PhD student, University of Michigan - Ann Arbor
1 paper at NeurIPS 2025
We show that limiting a model's confidence during training can improve test-time scaling in mathematical reasoning.