PhD student, Mila - Quebec Artificial Intelligence Institute
1 paper at NeurIPS 2025
We stabilize gradients for training increasingly deep reinforcement learning agents by using a second-order optimizer and residual connections