Researcher, Shanghai Artificial Intelligence Laboratory
2 papers at NeurIPS 2025
We study the thinking process in visual reinforcement finetuning