2 papers across 2 sessions
Global Convergence with Order-Optimal rate for Average Reward Constrained MDPs with Primal-Dual Natural Actor Critic Algorithm
We develop a practical algorithm for safe sim-to-real transfer