Calendar

The week's events

  • [Seminar] Victor Boone

    Category: Seminars [Seminar] Victor Boone


    April 3, 2024

    Who: Victor Boone

    When: Wednesday, April 3, 14:00-15:00

    Where: 447

    What: Learning MDPs with Extended Bellman Operators

    More: Efficiently learning Markov Decision Processes (MDPs) is difficult. When facing an unknown environment, where is the adequate limit between repeating actions that have shown their efficiency in the past (exploitation of your knowledge) and testing alternatives that may actually be better than what you currently believe (exploration of the environment)? To bypass this dilemma, a well-known solution is the "optimism-in-face-of-uncertainty" principle: Think of the score of an action as being the largest that is statistically plausible.

    The exploration-exploitation dilemma then becomes the problem of tuning optimism. In this talk, I will explain how optimism in MDPs can be all rephrased using a single operator, embedding all the uncertainty in your environment within a single MDP. This is a story about "extended Bellman operators" and "extended MDPs", and about how one can achieve minimax optimal regret using this machinery.

    Bâtiment IMAG (442)
    Saint-Martin-d'Hères, 38400
    France

Comments are closed.