PhD student, The Chinese University of Hong Kong
1 paper at NeurIPS 2025
We study the thinking process in visual reinforcement finetuning