PhD student, The University of Tokyo
1 paper at NeurIPS 2025
We rigorously identify the infinite–width limit distribution of neurons within a single attention layer under realistic architectural dimensionality