logo
today local_bar
Poster Session 6 · Friday, December 5, 2025 4:30 PM → 7:30 PM
#914

Beyond Constraint Violation for Online Convex Optimization with Adversarial Constraints

NeurIPS Poster OpenReview

Abstract

We study Online Convex Optimization with adversarial constraints (COCO). At each round a learner selects an action from a convex decision set and then an adversary reveals a convex cost and a convex constraint function. The goal of the learner is to select a sequence of actions to minimize both regret and the cumulative constraint violation (CCV) over a horizon of length . The best-known policy for this problem achieves regret and CCV. In this paper, we improve this by trading off regret to achieve substantially smaller CCV. This trade-off is especially important in safety-critical applications, where satisfying the safety constraints is non-negotiable.
Specifically, for any bounded convex cost and constraint functions, we propose an online policy that achieves regret and CCV, where is the dimension of the decision set and is a tunable parameter. We begin with a special case, called the Constrained Expert problem, where the decision set is a probability simplex and the cost and constraint functions are linear. Leveraging a new adaptive small-loss regret bound, we propose a computationally efficient policy for the Constrained Expert problem, that attains regret and CCV for number of experts.
The original problem is then reduced to the Constrained Expert problem via a covering argument. Finally, with an additional -smoothness assumption, we propose a computationally efficient first-order policy attaining regret and CCV.
Poster