Researcher, Tencent
2 papers at NeurIPS 2025
This paper the first unified multimodal CoT-based reward model, capable of multi-dimensional, step-by-step long-chain reasoning for both visual understanding and generation reward tasks.