Researcher, Apple
1 paper at NeurIPS 2025
We propose scaling laws that predict the loss of models when trained on a mixture of source domains.