Postdoc, Carnegie Mellon University
1 paper at NeurIPS 2025
Encouraging model in model-based reinforcement learning to converge to flatter minima in the loss landscape will result in better downstream policies