Long-run Average Reward

1 paper across 1 session

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Sample Complexity of Distributionally Robust Average-Reward Reinforcement Learning

#3210 · Zijun Chen, Shengbo Wang, Nian Si

We propose priori knowledge-free algorithms for distributionally robust average-reward estimation, with both finite-sample theoretical guarantees and numerical validation.