1 paper across 1 session
a sparse attention with $\mathcal O(n \log n)$ complexity for long video generation