6 papers across 2 sessions
We investigate last-iterate convergence of Regret Matching$^+$ variants in games satisfying the weak Minty variation inequality.
This paper enhances the fully decentralized cooperative multi-agent reinforcement learning from a context modeling perspective.
We present the first parameter-free last-iterate convergence of Counterfactual Regret Minimization algorithms.