Valda Seminar: Yoan Russac
8 November 2019, 10:30-11:30 ENS, S16 Weighted Linear Bandits for Non-stationary environments. We consider a stochastic linear bandit model in which the available actions correspond to arbitrary context vectors whose associated rewards follow a non-stationary linear regression model. In this setting, the unknown regression parameter is allowed to vary in…