MS student, Tsinghua University
1 paper at NeurIPS 2025
We propose MoGE, which enhances the Off-policy RL exploration by critical experiences generaion, leading to significant improvements in sample efficiency and performance ceilings across various tasks.