Principal Researcher, Contramont Research
1 paper at NeurIPS 2025
Ultra-realistic benchmark environments and evaluation framework for web agents