2 papers across 1 session
The paper proposes a principled reward design framework for training LLMs on tool use via reinforcement learning, leading to significant gains over SFT and baseline models in generalization and performance.
OS agents are vulnerable to Malicious Image Patches (MIPs) embedded in screenshots, enabling a novel attack that poses significant security risks.