1 paper across 1 session
This work presents planning and learning algorithms for average-cost MDPs with dynamic risk measures.