Learning Rate Schedules

2 papers across 2 sessions

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Asymptotic theory of SGD with a general learning-rate

#3205 · Or Goldreich, Ziyang Wei, SOHAM BONNERJEE, Jiaqi Li, Wei Biao Wu

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training

#907 · Minhak Song, Beomhan Baek, Kwangjun Ahn, Chulhee Yun

We show that Schedule-Free methods effectively navigate the river structure of the loss landscape, enabling scalable language model training without decay schedules or extra memory.