Assistant Professor, Carnegie Mellon University
1 paper at NeurIPS 2025
We train LLMs to reason efficiently using RL