Researcher, Hong Kong University of Science and Technology
1 paper at NeurIPS 2025
We adopt reinforcement learning to train LLMs to generate quality code with rewards derived from program analysis.