Researcher, Stepfun
3 papers at NeurIPS 2025
We introduce GUI Exploration Lab, a flexible simulator for GUI agent navigation. Experiments show a staged SFT + RL approach (especially multi-turn RL) significantly boosts navigation and exploration capabilities.
Exploration of rule-based reinforcement learning (RL) in MLLM post-training for perception policy learning.