Full Professor, Princeton University
1 paper at NeurIPS 2025
The paper proposes Causal Head Gating, a scalable, unsupervised method to classify transformer attention heads by causal impact on task performance that reveal task-specific sub-circuits.