Undergrad student, Tsinghua University, Tsinghua University
1 paper at NeurIPS 2025
This paper introduces a nove metric (REG) for evaluating the reasoning efficiency of LRMs and a reinforcement learning method (REO-RL) that significantly reduces reasoning redundancy while maintaining accuracy.