Associate Professor, Carnegie Mellon University
2 papers at NeurIPS 2025
We analyze what kind of LLMs have large improvement in RL finetuning and propose behavior injection augmentation to prepare the LLMs for RL.