Principal Researcher, 言创智信
2 papers at NeurIPS 2025
We investigate the phenomenon of few-shot expert identification in large Mixture-of-Experts models and propose EASY-EP, a simple yet effective method for expert pruning.
We cast unlearning as constrained optimization (minimize unlearning subject to bounded utility loss) and propose implicit gradient-surgery that recovers the constrained solution with one backprop, enabling efficient, utility-preserving unlearning.