Researcher, NVIDIA
1 paper at NeurIPS 2025
We propose joint recall, a novel synthetic task, and hybrid sparse attention with context-dependent sparsity for better sub-quadratic long-context modeling.