Michael R DeWeese

Full Professor, University of California Berkeley

3 papers at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

#4111 · Daniel Kunin, Giovanni Luca Marchetti, Feng Chen, Dhruva Karkada, James B Simon, Michael R DeWeese, Surya Ganguli, Nina Miolane

We introduce Alternating Gradient Flows, a framework modeling feature learning in two-layer networks with small initialization as utility maximization and cost minimization—unifying saddle-to-saddle analyses and explaining Fourier feature emergence

Quantifying Elicitation of Latent Capabilities in Language Models

#3606 · Elizabeth Donoway, Hailey Joren, Arushi Somani, Henry Sleight, Julian Michael, Michael R DeWeese, John Schulman, Ethan Perez, Fabien Roger, Jan Leike

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models

#3303 · Dhruva Karkada, James B Simon, Yasaman Bahri, Michael R DeWeese

We solve the learning dynamics of (a close approximation of) word2vec in closed form, revealing what semantic features are learned.