Undergrad student, Beijing University of Aeronautics and Astronautics
1 paper at NeurIPS 2025
We propose RF-Agent, an automated RL reward function design framework via language agent tree search.