Jeffrey Willette

Student, Korea Advanced Institute of Science and Technology

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

#3517 · Jeffrey Willette, Heejun Lee, Sung Ju Hwang

We identify a problem with sparse attention inference and propose a simple solution. Our solution increases performance by a large margin while maintaining low latency.