Undergrad student, University of the Chinese Academy of Sciences
2 papers at NeurIPS 2025
Exploration of rule-based reinforcement learning (RL) in MLLM post-training for perception policy learning.