Researcher, Microsoft Research Asia
1 paper at NeurIPS 2025
This paper the first unified multimodal CoT-based reward model, capable of multi-dimensional, step-by-step long-chain reasoning for both visual understanding and generation reward tasks.