Attention sink

2 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Vision Transformers Don't Need Trained Registers

#4101 Spotlight · Nicholas Jiang, Amil Dravid, Alexei Efros, Yossi Gandelsman

We find a small set of neurons whose activations can be redirected at test-time to mitigate high-norm artifacts in Vision Transformers.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

What are you sinking? A geometric approach on attention sink

#3516 Spotlight · Valeria Ruscio, Umberto Nanni, Fabrizio Silvestri

Attention sink in LLMs serves as geometric reference frames that anchor token representations in high-dimensional space, emerging during training as optimal solutions to the coordinate system problem, shaped by architecture and position encodings.