Assistant Professor, New Jersey Institute of Technology
2 papers at NeurIPS 2025
We provide O(\epsilon^{-4}) iteration complexity policy optimization algorithm for robust constrained Markov Decision Processing
We propose the first provably efficient and episode-wise safe RL algorithm for linear constrained MDPs.