PhD student, New Jersey Institute of Technology
1 paper at NeurIPS 2025
We provide O(\epsilon^{-4}) iteration complexity policy optimization algorithm for robust constrained Markov Decision Processing