Full Professor, University of Illinois at Urbana-Champaign
2 papers at NeurIPS 2025
The paper proposes a principled reward design framework for training LLMs on tool use via reinforcement learning, leading to significant gains over SFT and baseline models in generalization and performance.
We introduce MIRAGE, a benchmark for multimodal expert consultation in agriculture featuring single-turn and multi-turn tasks.