Brenden Lake

Associate Professor, Princeton University

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts

#1104 Spotlight · Chen Yueh-Han, Guy Davidson, Brenden Lake

SAGE‑Eval is the first benchmark to test whether frontier LLMs robustly generalize critical safety knowledge to novel situations, and we show that the strongest model we tested only passed 58% of safety facts evaluated.

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Do different prompting methods yield a common task representation in language models?

#1016 · Guy Davidson, Todd M. Gureckis, Brenden Lake, Adina Williams

Different ways of prompting the same task elicit different task representations in language models.