Full Professor, Northwest Polytechnical University Xi'an
2 papers at NeurIPS 2025
DP²O-SR post-trains generative SR models to better match human perceptual preferences, by optimizing over diverse outputs (sampled only via noise) using IQA-based rewards, without requiring human annotations during training.