PhD student, Department of Computer Science, University of Oxford
1 paper at NeurIPS 2025
Training reinforcement learning agents from a single language instruction using vision-language models.