2 papers across 1 session
We rigorously identify the infinite–width limit distribution of neurons within a single attention layer under realistic architectural dimensionality