1 paper across 1 session
We propose a bandit framework for GenAI-powered decisions where actions (queries) affect the environment only through stochastic model outputs, and introduce algorithms tailored to this setup.