Full Professor, Carnegie Mellon University
2 papers at NeurIPS 2025
Through theoretical models and empirical testbeds, we characterize the algorithmic tradeoff between privileged expert distillation and RL, and better options for expert distillation.