1 paper across 1 session
We propose an online reinforcement learning technique to fine-tune a family of flow matching policies for robot learning.