Associate Professor, The Chinese University of Hong Kong
2 papers at NeurIPS 2025
For multi-agent offline safe reinforcement learning (MOSRL), we propose the first algorithm MOSDT, and the first dataset and benchmark MOSDB.