Gal Vardi

Assistant Professor, Weizmann Institute of Science

3 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

1 paper

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes

#3000 Spotlight · Itamar Harel, Yonathan Wolanowsky, Gal Vardi, Nathan Srebro, Daniel Soudry

We derive simple generalization bounds for Markov training processes at any time during training, and then apply them to training with Langevin dynamics to improve existing bounds.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Benign Overfitting in Single-Head Attention

#3203 · Roey Magen, Shuning Shang, Zhiwei Xu, Spencer Frei, Wei Hu, Gal Vardi

We prove that under appropriate conditions, a single-head softmax attention mechanism exhibits benign overfitting

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Transformers are almost optimal metalearners for linear classification

#2902 · Roey Magen, Gal Vardi

We prove that, under appropriate conditions, linear attention is an almost optimal metalearner for linear classification.