Full Professor, University of Exeter
3 papers at NeurIPS 2025
We propose a new algorithm that introduces guarantees for minimum user satisfaction rates in language model zoos while optimizing for operating cost, which can be practical for inference endpoint services.
To adapt ML models to concept drift under strict resource constraints, we propose a lightweight drift-plus-penalty policy that provably limits resource usage and achieves robust results.