PhD student, Purdue University
1 paper at NeurIPS 2025
Global Convergence with Order-Optimal rate for Average Reward Constrained MDPs with Primal-Dual Natural Actor Critic Algorithm