PhD student, Institute of automation, Chinese academy of science, Chinese Academy of Sciences
1 paper at NeurIPS 2025
Transformers can learn self-verifying reflection without language, and reinforcement learning enhances performance through shallow statistical patterns.