1 paper across 1 session
We propose a novel value function learning scheme for hierarchical policy in offline GCRL