Postdoc, Purdue University
2 papers at NeurIPS 2025
Global Convergence with Order-Optimal rate for Average Reward Constrained MDPs with Primal-Dual Natural Actor Critic Algorithm