Assistant Professor, Singapore Management University
2 papers at NeurIPS 2025
We propose GRIFFIN to accelerate the inference speed of LLM by addressing the token misalignment issue in speculative decoding.
We propose SoPo, a semi-online preference optimization method, combining the strengths of online and offline direct preference optimization to overcome their individual shortcomings.