today local_bar

Sehoon Kim

Researcher, xAI

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Multipole Attention for Efficient Long Context Reasoning

#3518 · Coleman Richard Charles Hooper, Sebastian Zhao, Luca Manolache, Sehoon Kim, Michael W. Mahoney, Sophia Shao, Kurt Keutzer, Amir Gholami

Accelerating attention for long-context reasoning by identifying and loading important tokens and by approximating attention to less important tokens