today local_bar

Shauli Ravfogel

Faculty Fellow, New York University

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Emergence of Linear Truth Encodings in Language Models

#2614 · Shauli Ravfogel, Gilad Yehudai, Tal Linzen, Joan Bruna, Alberto Bietti

We propose a toy model that shows how linear truth encodings can arise in language models.

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Preserving Task-Relevant Information Under Linear Concept Removal

#4109 · Floris Holstege, Shauli Ravfogel, Bram Wouters

We derive a closed-form solution for a linear erasure projection that preserves the covariance with the main task labels.