Full Professor, Renmin University of China
1 paper at NeurIPS 2025
This paper investigates what kind of R1-Zero-like training is suitable for grounding tasks in GUI agents.