Full Professor, Washington State University, Pullman
1 paper at NeurIPS 2025
Creating safe and reward maximization policies from offline data via min-max optimization formulation and solving it using no-regret algorithms