PhD student, Örebro University
1 paper at NeurIPS 2025
We provide a scalable bandit architecture for prompt tuning of decision transformers for increased downstream performance.