PhD student, Massachusetts Institute of Technology
1 paper at NeurIPS 2025
RL to train LLMs how to generate data and update themselves to adapt to new knowledge/tasks.