Assistant Professor, Carnegie Mellon University
2 papers at NeurIPS 2025
We develop and analyze a scalable algorithm for multi-agent RL by sampling from the mean-field distribution of the agents to overcome the curse of dimensionality.
We design a novel algorithm to stabilize an unknown partially observable LTI system with a complexity bound independent of the state dimension.