PhD student, The Hong Kong University of Science and Technology
1 paper at NeurIPS 2025
LoTA-QAF uses ternary adaptation for fine-tuning quantized LLMs, enabling the lossless merging of adaptation into quantized weights.