PhD student, University of Waterloo
1 paper at NeurIPS 2025
We introduce a novel reasoning paradigm -- pixel-space reasoning. We identified the learning trap when cultivating this ability and proposed a curiosity-driven RL approach to address it.