Assistant Professor, New Jersey Institute of Technology
1 paper at NeurIPS 2025
Creating safe and reward maximization policies from offline data via min-max optimization formulation and solving it using no-regret algorithms