PhD student, The Hong Kong University of Science and Technology
2 papers at NeurIPS 2025
We introduce a novel reasoning paradigm -- pixel-space reasoning. We identified the learning trap when cultivating this ability and proposed a curiosity-driven RL approach to address it.