unsupervised environment design

1 paper across 1 session

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

LILO: Learning to Reason at the Frontier of Learnability

#4905 · Thomas Foster, Anya Sims, Johannes Forkel, Jakob Foerster

Prove with theory and empirical results that prioritising training on questions with "medium" level of difficulty is beneficial for training reasoning models with RL