Undergrad student, Tsinghua University, Tsinghua University
2 papers at NeurIPS 2025
SageAttention3: Microscaling FP4 Attention for Plug-and-Play Inference Acceleration and An Exploration of 8-Bit Attention for Training.
Trainable sparse attention for video diffusion model.