ML Systems - NeurIPS 2025

today local_bar

ML Systems

1 paper across 1 session

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Multipole Attention for Efficient Long Context Reasoning

#3518 · Coleman Hooper, Sebastian Zhao, Luca Manolache, Sehoon Kim, Michael Mahoney, Sophia Shao, Kurt Keutzer, Amir Gholami

Accelerating attention for long-context reasoning by identifying and loading important tokens and by approximating attention to less important tokens