1 paper across 1 session
A complete reimplementation of MiniGrid environments with JAX unlocking 160,000x faster experimentation