2 papers across 1 session
We derive high probability excess risk bounds to at most $\tilde{O}(1/n^2)$ for ERM, GD and SGD and our high probability results on the generalization error of gradients for nonconvex problems are also the sharpest.
We present the first generalization bound for algorithm configuration that closely approximates practical model-based algorithm configurators.