PhD student, Swiss Federal Institute of Technology
2 papers at NeurIPS 2025
We provide a method for accurate end-to-end FP4 training of Large Language Models.
A low-precision scheme for fine-tuning LLMs