PhD student, University of California, Berkeley
4 papers at NeurIPS 2025
a sparse attention with $\mathcal O(n \log n)$ complexity for long video generation
We propose a method to speedup video diffusion generation through efficient attention.
We propose a method which exploit KV cache sparsity efficiently and dynamically through Top-P sampling.