PhD student, Department of Computer Science, ETHZ - ETH Zurich
1 paper at NeurIPS 2025
Scaling neural networks leads to compositional generalization if the training distribution sufficiently covers the task space.