2 papers across 2 sessions
We propose a very sparse attention mechanism for diffusion models on T2V and T2I tasks through re-use of softmax statistics.