?
today
local_bar
search
online learning; Contextual Markov Decision Process; maximum likelihood estimation; delayed reward; reinforcement learning; auto-bidding
1 paper across 1 session
Poster Session 1
1 paper
Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Learning Personalized Ad Impact via Contextual Reinforcement Learning under Delayed Rewards
star
#3206
·
Yuwei Cheng, Zifeng Zhao, Haifeng Xu