Full Professor, Tel Aviv University
1 paper at NeurIPS 2025
We show that transformers with linear width can solve many graph problems using constant depth, revealing a trade-off where increasing width enables shallower, faster models—though some tasks still demand quadratic width.