Associate Professor, Southeast University
1 paper at NeurIPS 2025
We propose a human-in-the-loop learning method that achieves faithful imitation via distribution alignment and adapts to evolving behavior using dynamic regret minimization.