Assistant Professor, University of Illinois Urbana-Champaign
3 papers at NeurIPS 2025
RL fine-tuning in LLMs updates a small subnetwork containing 20–30% of parameters leaving rest of the parameters unchanged.