Researcher, Thinking Machines Lab
2 papers at NeurIPS 2025
We adopt reinforcement learning to train LLMs to generate quality code with rewards derived from program analysis.