Events in December 2023–January 2024
MonMonday | TueTuesday | WedWednesday | ThuThursday | FriFriday | SatSaturday | SunSunday |
---|---|---|---|---|---|---|
November 27, 2023
|
November 28, 2023
|
November 29, 2023
|
November 30, 2023
|
DecemberDecember 1, 2023 |
December 2, 2023
|
December 3, 2023
|
December 4, 2023
|
December 5, 2023
|
December 6, 2023
|
December 7, 2023
|
December 8, 2023
|
December 9, 2023
|
December 10, 2023
|
December 11, 2023
|
December 12, 2023
|
December 13, 2023
|
December 14, 2023
|
December 15, 2023
|
December 16, 2023
|
December 17, 2023
|
December 18, 2023
|
December 19, 2023
|
December 20, 2023
|
December 21, 2023
|
December 22, 2023
|
December 23, 2023
|
December 24, 2023
|
December 25, 2023
|
December 26, 2023
|
December 27, 2023
|
December 28, 2023
|
December 29, 2023
|
December 30, 2023
|
December 31, 2023
|
JanuaryJanuary 1, 2024 |
January 2, 2024
|
January 3, 2024
|
January 4, 2024
|
January 5, 2024
|
January 6, 2024
|
January 7, 2024
|
January 8, 2024
|
January 9, 2024
|
January 10, 2024
|
January 11, 2024
|
January 12, 2024
|
January 13, 2024
|
January 14, 2024
|
January 15, 2024
|
January 16, 2024
|
January 17, 2024
|
January 18, 2024
|
January 19, 2024
|
January 20, 2024
|
January 21, 2024
|
January 22, 2024
|
January 23, 2024
|
January 24, 2024
|
January 25, 2024
|
January 26, 2024
|
January 27, 2024
|
January 28, 2024
|
January 29, 2024
|
January 30, 2024
|
January 31, 2024
|
FebruaryFebruary 1, 2024 |
February 2, 2024
|
February 3, 2024
|
February 4, 2024
|
- February 7, 2023 @ Bâtiment IMAG (406) -- Seminar Polaris-tt Learning in finite-horizon MDP with UCB (Romain Cravic)
Most of you probably know Markov Decisions Processes (MDP). They are very useful to handle situations where an agent interacts with an environnement that may involve randomness. Concretely, at each time the MDP has a current state and the agent chooses an action : This couple state-action induces a (random) reward and a (random) state transition. If the probability distributions for rewards and transitions are known, at least theoretically, designing optimal behaviors for the agent is easy. What about the case where these distributions are unknown at the early stage of the process ? How to LEARN optimal behaviors efficiently ? A popular way to handle this issue is to use the optimism paradigm, inspired from UCB algorithms designed for stochastic bandits problems. In this talk, I will expose the main ideas of two possible approaches, UCRL algorithm and optimistic Q-learning algorithm, that use optimism to well perform in finite-horizon
- February 21, 2023 @ Bâtiment IMAG (406) -- Seminar Polaris-tt: Decomposition of Normal Form Games - Harmonic, Potential, and Non-Strategic Games (Davide Legacci)
In this talk, we will explore the concept of normal form games and their decomposition into non-strategic, harmonic, and potential games. We will begin by introducing the response graph of a game, which is a visual representation of the strategies available to each player and their corresponding utilities. What dictates the strategic interaction among players is the difference between utilities, rather than the utilities themselves. We will introduce an object that captures this behavior, called deviation flow of the game, and use it to define non-strategic, harmonic, and potential games. Finally, we will discuss the properties of these components.
- March 30, 2023 @ Bâtiment IMAG (amphitheater) -- PhD defense Kimang Khun: Apprentissage par renforcement dans les systèmes dynamiques structurésThèse supervisée par Nicolas GAST et Bruno GAUJAL.