PhD student, University of Cambridge
1 paper at NeurIPS 2025
We propose a cognitive-science-inspired framework and benchmark to systematically evaluate learning abilities of large language models.