MS student, ETHZ - ETH Zurich
1 paper at NeurIPS 2025
Scaling neural networks leads to compositional generalization if the training distribution sufficiently covers the task space.