PhD student, Carnegie Mellon University
1 paper at NeurIPS 2025
We analyze what kind of LLMs have large improvement in RL finetuning and propose behavior injection augmentation to prepare the LLMs for RL.