1 paper across 1 session
We propose scaling laws that predict the loss of models when trained on a mixture of source domains.