Full Professor, Zhejiang University
1 paper at NeurIPS 2025
We reveal how return-coverage affects the performance of conditional sequence modeling policies in offline RL and propose an algorithm achieving new state-of-the-art results on D4RL.