Instructor, Kuaishou- 快手科技
3 papers at NeurIPS 2025
This paper presents a systematic pipeline for improving video generation with human feedback, including a large-scale preference dataset, a video reward model, and three alignment algorithms for flow matching models.
preference fading discrete diffusion tailored for recommendation via modeling preference ratios