MS student, École Polytechnique de Montréal, Université de Montréal
1 paper at NeurIPS 2025
We provide a statistically rigorous guidelines for training interactive, multi-step LLM agents, exploring optimal compute allocation, generalization, and hyperparameter settings.