1 paper across 1 session
We introduce and analyze the Attention-Indexed Model (AIM), a theoretical framework for analyzing learning in deep attention layers.