Associate Professor, Kyoto University
1 paper at NeurIPS 2025
We propose the first provably efficient and episode-wise safe RL algorithm for linear constrained MDPs.