MS student, Tsinghua University, Tsinghua University
1 paper at NeurIPS 2025
Leveraging the pre-trained diffusion model as a powerful and cost-effective step-level reward model to optimize the diffusion model itself directly in the noisy latent space.