Poster Session 3 · Thursday, December 4, 2025 11:00 AM → 2:00 PM
#5514
Balancing Gradient and Hessian Queries in Non-Convex Optimization
Abstract
We develop optimization methods which offer new trade-offs between the number of gradient and Hessian computations needed to compute the critical point of a non-convex function.
We provide a method that for a twice-differentiable with -Lipschitz Hessian, and input initial point with -bounded sub-optimality and sufficiently small outputs an -critical point, i.e., a point such that , using queries to a gradient oracle and queries to a Hessian oracle. As a consequence, we obtain an improved gradient query complexity of in the case of bounded dimension and of in the case where we are allowed only a single Hessian query.
We obtain these results through a more general algorithm which can handle approximate Hessian computations and recovers known prior state-of-the-art bounds of computing an -critical point, under the additional assumption that has an -Lipschitz gradient, with -gradient queries.