4 papers across 2 sessions
An LLM-based retrosynthesis planning agent trained end-to-end with Agentic Reinforcement Learning.