1 paper across 1 session
Local SGD converges faster under low second-order heterogeneity, and we prove it with tight bounds and supporting experiments.