MS student, Nanjing University
1 paper at NeurIPS 2025
This paper introduces CoPDT, a method of using one unified and adaptable DT model for multi-task (multi-budget or multi-constraint) offline safe RL.