PhD student, Beijing University of Posts and Telecommunications
1 paper at NeurIPS 2025
This paper investigates how hallucinations arise and persist in RLLM reasoning, revealing error self-reinforcement and limited metacognition.