Full Professor, University of Edinburgh, University of Edinburgh
2 papers at NeurIPS 2025
Training LLMs to combine reasoning with external knowledge retrieval via RL without any supervised data on reasoning steps.
We propose LongBioBench for controllable evaluation on Long-Context Language Models