Full Professor, University of California Berkeley
1 paper at NeurIPS 2025
We propose BOOM, a model-based RL that uses a soft value-weighted likelihood-free alignment loss to bootstrap the policy from non-parametric planner with world model, achieving state-of-the-art performance.