Quantized training

2 papers across 2 sessions

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

#3406 · Roberto Castro, Andrei Panferov, Rush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, Dan Alistarh

We provide a method for accurate end-to-end FP4 training of Large Language Models.

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic

#4103 · Kanghyun Choi, Hyeyoon Lee, Sunjong Park, Dain Kwon, Jinho Lee

FALQON accelerates LoRA fine-tuning by up to 3$\times$ through merging adapters into an FP8-quantized backbone, removing redundant quantization overhead from small matrices.