Undergrad student, University of Science and Technology of China
1 paper at NeurIPS 2025
We introduce a novel reasoning paradigm -- pixel-space reasoning. We identified the learning trap when cultivating this ability and proposed a curiosity-driven RL approach to address it.