Assistant Professor, Renmin University of China, Tsinghua University
1 paper at NeurIPS 2025
This paper is to jointly optimize mutiple modules in complex RAG pipeline using multi-agent reinforcement learning.