PhD student, University of Pennsylvania, University of Pennsylvania
1 paper at NeurIPS 2025
By framing grokking as computational glass relaxation, this work explains grokking from the perspective of Boltzmann entropy and proposes a physics-based grokking-resistant optimizer.