Postdoc, EPFL - EPF Lausanne
2 papers at NeurIPS 2025
We study the high-dimensional asymptotics of empirical risk minimization (ERM) in over-parametrized two-layer neural networks with quadratic activations trained on synthetic data.
We introduce and analyze the Attention-Indexed Model (AIM), a theoretical framework for analyzing learning in deep attention layers.