PhD student, University of California, Berkeley
3 papers at NeurIPS 2025
We present JetLM, a new family of LMs, which matches leading full-attention models while significantly improving generation throughput.
a sparse attention with $\mathcal O(n \log n)$ complexity for long video generation
We propose a method to speedup video diffusion generation through efficient attention.