PhD student, Beijing University of Aeronautics and Astronautics
2 papers at NeurIPS 2025
We propose RF-Agent, an automated RL reward function design framework via language agent tree search.
We construct a Progress Reward Model with convergence guarantee for Reinforcement Learning via Large Language Models.