PhD student, Washington State University
1 paper at NeurIPS 2025
Creating safe and reward maximization policies from offline data via min-max optimization formulation and solving it using no-regret algorithms