Assistant Professor, Monash University
3 papers at NeurIPS 2025
We introduce SWIFT, a token-level self-alignment and distillation method that improves LLMs by assigning token weights via a teacher model