Researcher, Microsoft
1 paper at NeurIPS 2025
We find that excessively scaling Chain of Thought (CoT) length can impair the model's reasoning performance in certain domains, and we propose a Thinking-Optimal Scaling strategy to achieve more effective and efficient test-time scaling.