Principal Researcher, The Hospital for Sick Children (SickKids)
2 papers at NeurIPS 2025
We introduce a benchmark using simulated biological systems to evaluate LLMs' scientific discovery capabilities.