today local_bar

Shan Ning

PhD student, ShanghaiTech University

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation

#5007 · Longtian Qiu, Shan Ning, Jiaxuan Sun, Xuming He

A systematic multimodal RL framework that improves the policy exploration and advantage estimation.