2 papers across 2 sessions
We propose a novel value function learning scheme for hierarchical policy in offline GCRL