Undergrad student, Huazhong University of Science and Technology
1 paper at NeurIPS 2025
This paper introduces CorrectBench, the first comprehensive benchmark for systematically evaluating self-correction mechanisms in LLMs.