Lecturer, University of Oxford
2 papers at NeurIPS 2025
Ultra-realistic benchmark environments and evaluation framework for web agents
Defences against LLM misuse fine-tuning attacks that aim to detect individual malicious or suspicious samples are insufficient.