PhD student, Mila - Quebec AI Institute
1 paper at NeurIPS 2025
We provide a statistically rigorous guidelines for training interactive, multi-step LLM agents, exploring optimal compute allocation, generalization, and hyperparameter settings.