Researcher, xAI
1 paper at NeurIPS 2025
Accelerating attention for long-context reasoning by identifying and loading important tokens and by approximating attention to less important tokens