1 paper across 1 session
We rigorously identify the infinite–width limit distribution of neurons within a single attention layer under realistic architectural dimensionality