Radu-Alexandru Dragomir

Assistant Professor, Télécom Paris

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation

#3911 · Etienne Boursier, Scott Pesme, Radu-Alexandru Dragomir

We describe the training of overparametrized architectures with small weight decay as a two-phase dynamics. In particular during the second phase, it follows a Riemannian flow of the norm on the interpolation manifold.