2 papers across 2 sessions
We provide a statistically rigorous guidelines for training interactive, multi-step LLM agents, exploring optimal compute allocation, generalization, and hyperparameter settings.
We provide the statistical guarantee for Sinkhorn bridge method to estimate Schrödinger bridge