PhD student, The Chinese University of Hong Kong
1 paper at NeurIPS 2025
For multi-agent offline safe reinforcement learning (MOSRL), we propose the first algorithm MOSDT, and the first dataset and benchmark MOSDB.