1 paper across 1 session
Causal representation learning for downstream tasks (formalized as reward), with algorithms and regret bounds.