PhD student, Johns Hopkins University
2 papers at NeurIPS 2025
Exploration of rule-based reinforcement learning (RL) in MLLM post-training for perception policy learning.