Principal AI Research Manager, MediaTek Research
1 paper at NeurIPS 2025
We derive no-regret guarantees for Thompson sampling in episodic reinforcement learning with Gaussian process modelling.