Assistant Professor, Nanjing University
1 paper at NeurIPS 2025
LoTA-QAF uses ternary adaptation for fine-tuning quantized LLMs, enabling the lossless merging of adaptation into quantized weights.