MS student, Seoul National University
1 paper at NeurIPS 2025
We propose the first Thompson Sampling algorithm with Pareto regret guarantees in multi-objective linear contextual bandit.