Researcher, Tsinghua University
1 paper at NeurIPS 2025
An LLM-based retrosynthesis planning agent trained end-to-end with Agentic Reinforcement Learning.