Undergrad student, Tsinghua University
1 paper at NeurIPS 2025
SageAttention3: Microscaling FP4 Attention for Plug-and-Play Inference Acceleration and An Exploration of 8-Bit Attention for Training.