MS student, Southwest University of Finance and Economics
1 paper at NeurIPS 2025
LoTA-QAF uses ternary adaptation for fine-tuning quantized LLMs, enabling the lossless merging of adaptation into quantized weights.