Researcher, Facebook
1 paper at NeurIPS 2025
CoRe: a high-quality, multi-lingual benchmark for evaluating LLMs’ Code Reasoning capabilities with fundamental static analysis tasks.