PhD student, EPFL - EPF Lausanne
1 paper at NeurIPS 2025
We perform an important step towards LLM pure FP8 training by enabling stable FP8 dot product attention reaching new throughput records