Researcher, Apple
1 paper at NeurIPS 2025
We show that using checklists to automatically grade responses for reinforcement learning leads to improved instruction following