1 paper across 1 session
We propose deep RL method to learn both Nash equilibrium policies and distributions in non-stationary, continuous-space MFGs.