Researcher, Salesforce AI Research
2 papers at NeurIPS 2025
An agentic pipeline for multi-turn synthetic data generation that produces high-quality training data for AI agents.
We analyze what kind of LLMs have large improvement in RL finetuning and propose behavior injection augmentation to prepare the LLMs for RL.