Associate Professor, Hangzhou Dianzi University
2 papers at NeurIPS 2025
This paper introduces CorrectBench, the first comprehensive benchmark for systematically evaluating self-correction mechanisms in LLMs.