Postdoc, Tsinghua University, Tsinghua University
1 paper at NeurIPS 2025
DIET makes LLMs more token-efficient by using problem difficulty to dynamically guide compression during reinforcement learning, boosting reasoning performance and enabling superior inference scaling under fixed budgets.