Sida Li

PhD student, University of Chicago

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning

#4009 · Jingyang Yi, Justin Wang, Sida Li

We propose ShorterBetter, a reinforcement learning method that trains reasoning models to generate concise yet accurate Chain-of-Thought traces by rewarding the shortest correct response among sampled outputs.