Full Professor, Nanjing University of Science and Technology
1 paper at NeurIPS 2025
We propose a human-in-the-loop learning method that achieves faithful imitation via distribution alignment and adapts to evolving behavior using dynamic regret minimization.