2 papers across 1 session
In this paper, we systematically evaluate RL and control-based methods on a suite of navigation tasks, using offline datasets of varying quality.
A method to produce policies directly from language instructions without in-domain supervision