Researcher, Microsoft Research
2 papers at NeurIPS 2025
We propose a cognitive-science-inspired framework and benchmark to systematically evaluate learning abilities of large language models.