PhD student, University of Illinois at Urbana-Champaign
3 papers at NeurIPS 2025
RL fine-tuning in LLMs updates a small subnetwork containing 20–30% of parameters leaving rest of the parameters unchanged.