5 papers across 3 sessions
We develop a new framework and establish convergence to second order stationary points under generalized smoothness
We show that clipped SGD converges with high probability on convex $(L_0,L_1)$-smooth functions.