-
News
- Journée au vert POLARIS 2022/05/23
- DATAMOVE/POLARIS picnic 2021/06/22
- DATAMOVE/POLARIS BBQ 2019 2019/06/14
- POLARIS Bootcamp (May 2019) 2019/05/24
- slides of Andras Gyorgy 2016/01/15
Next seminars
- 2:00 pm – 3:00 pm, February 7, 2023 – Seminar Polaris-tt Learning in finite-horizon MDP with UCB (Romain Cravic)
Events
Events in February–March 2023
MMonday TTuesday WWednesday TThursday FFriday SSaturday SSunday 30January 30, 2023 31January 31, 2023 1February 1, 2023 2February 2, 2023 3February 3, 2023 4February 4, 2023 5February 5, 2023 6February 6, 2023 7February 7, 2023●(1 event) Seminar Polaris-tt Learning in finite-horizon MDP with UCB (Romain Cravic)
–
February 7, 2023Most of you probably know Markov Decisions Processes (MDP). They are very useful to handle situations where an agent interacts with an environnement that may involve randomness. Concretely, at each time the MDP has a current state and the agent chooses an action : This couple state-action induces a (random) reward and a (random) state transition. If the probability distributions for rewards and transitions are known, at least theoretically, designing optimal behaviors for the agent is easy. What about the case where these distributions are unknown at the early stage of the process ? How to LEARN optimal behaviors efficiently ? A popular way to handle this issue is to use the optimism paradigm, inspired from UCB algorithms designed for stochastic bandits problems. In this talk, I will expose the main ideas of two possible approaches, UCRL algorithm and optimistic Q-learning algorithm, that use optimism to well perform in finite-horizon
Bâtiment IMAG (406)Saint-Martin-d'Hères, 38400France8February 8, 2023 9February 9, 2023 10February 10, 2023 11February 11, 2023 12February 12, 2023 13February 13, 2023 14February 14, 2023 15February 15, 2023 16February 16, 2023 17February 17, 2023 18February 18, 2023 19February 19, 2023 20February 20, 2023 21February 21, 2023 22February 22, 2023 23February 23, 2023 24February 24, 2023 25February 25, 2023 26February 26, 2023 27February 27, 2023 28February 28, 2023 1March 1, 2023 2March 2, 2023 3March 3, 2023 4March 4, 2023 5March 5, 2023 Meta