Full Professor, Korea Advanced Institute of Science and Technology
3 papers at NeurIPS 2025
Using teacher value function and PBRS, propose a theoretically grounded method for preference distillation
Token reduction with mitigating rank-collapsing (over-smoothing) through high-frequency token preservation