1 paper across 1 session
We show that clipped SGD converges with high probability on convex $(L_0,L_1)$-smooth functions.