PhD student, Tel Aviv University
1 paper at NeurIPS 2025
We present regret bounds for adversarial contextual bandits with general function approximation under delayed bandit feedback.