PhD student, Max Planck Institute for Intelligent Systems, Max-Planck Institute
2 papers at NeurIPS 2025
We propose a value gradient matching formulation for reward finetuning/alignment for flow matching models with the theory of optimal control, and empirically verify our method on the popular text-to-image flow matching model StableDiffusion3