PhD student, University of Oxford
1 paper at NeurIPS 2025
We propose a principled taxonomy, evaluation procedure, and unified algorithm space for offline RL.