Associate Professor, Tsinghua University
2 papers at NeurIPS 2025
SageAttention3: Microscaling FP4 Attention for Plug-and-Play Inference Acceleration and An Exploration of 8-Bit Attention for Training.
We propose a method to speedup video diffusion generation through efficient attention.