Thomas L. Griffiths

Professor, Princeton University

3 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)

#301 · Ruaridh Mon-Williams, Max Taylor-Davies, Elizabeth Mieczkowski, Natalia Vélez, Neil R Bramley, Yanwei Wang, Thomas L. Griffiths, Christopher G. Lucas

Recurrent neural networks spontaneously model partners during collaboration — without specialised architectures — but only when partner-specific adaptation improves task performance.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Are Large Language Models Sensitive to the Motives Behind Communication?

#2207 · Addison J. Wu, Ryan Liu, Kerem Oktar, Theodore Sumers, Thomas L. Griffiths

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers

#1115 · Andrew Joohun Nam, Henry Conklin, Yukang Yang, Thomas L. Griffiths, Jonathan D. Cohen, Sarah-Jane Leslie

The paper proposes Causal Head Gating, a scalable, unsupervised method to classify transformer attention heads by causal impact on task performance that reveal task-specific sub-circuits.