1 paper across 1 session
This paper proposes a novel deep reinforcement learning algorithm robust to unobserved confounding bias in observed data over complex and high-dimensional domains.