Professor, Nanjing University
4 papers at NeurIPS 2025
This paper introduces CoPDT, a method of using one unified and adaptable DT model for multi-task (multi-budget or multi-constraint) offline safe RL.
We introduce a novel object selection mechanism to allow sim-trained policies to rapidly adapt to real-world visual perturbations.