Search Scientist, Baidu Inc.
2 papers at NeurIPS 2025
This paper is to jointly optimize mutiple modules in complex RAG pipeline using multi-agent reinforcement learning.
We propose ExSearch, an agentic search framework, where the LLM learns to retrieve useful information as the reasoning unfolds through a self-incentivized process.