PhD student, Tsinghua University, Tsinghua University
2 papers at NeurIPS 2025
We propose MoGE, which enhances the Off-policy RL exploration by critical experiences generaion, leading to significant improvements in sample efficiency and performance ceilings across various tasks.