Seminar

SequeL seminars are held in room A00 on Friday mornings, at 11am (any change of schedule is reported in the Comment section). Seminars either present original work of the speaker, or part of the reading group talk series. You can also find all events in the public Google Calendar feed of the seminar series, or on the seminars platform of the University of Lille.

2019/2020

Date Speaker Title Comment
Sep. 12th Yasin Abasi Yadkori (VinAI) Efficient exploration in sequential decision making problems Thursday, 11am, room A00
Sept 17th Anders Jonsson (UPF, SequeL) Overview of Anders’ works Tuesday, 11am, room A00
Oct 4th Mathieu Seurin (SequeL) Reinforcement Learning from forbidden actions and Instruction following Friday, 11am, room A00
Oct 11th Rashid Bakirov Automating the design of adaptive machine learning algorithms following Friday, 4pm, room A00
Oct 18th Yannis Flet-Berliac (SequeL) Deep Reinforcement Learning: Improving Policy Gradient Updates Friday, 11am, room A00
Oct 25th Ana Bušić (Inria DYOGENE) Zap Stochastic Approximation and Reinforcement Learning Friday, 11am, room A00
Oct 28th Arthur Mensch (ENS Paris) Predicting probability distributions from data: a smoothing perspective Monday, 11am, room A00
Nov 8th Lilian Besson (CentraleSupélec Rennes, SequeL) Multi-players Bandit Algorithms for Internet of Things Networks Salle Plénière
Nov 14th Arun Verma (IIT Bombay) Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback
(Slides)
Thursday, 2pm, room A00
Nov 22nd Jill-Jênn Vie (SequeL) Learning representations of humans that evolve over time Friday, 11am,
Salle Plénière
Nov 29th Johannes Kirschner (ETH) Beyond Optimism: Information-Directed Exploration in Bandits (Slides) Friday, 11am,
room A00
Dec 6th Rémi Besson (Polytechnique) TBA

2018/2019

Date Speaker Title Comment
Sep. 21st Hazem Wannous (3DSAM, CRIStAL) Online hand gesture recognition using combined convolutional and recurrent networks
Oct. 12th Florian Strub (SequeL) Language grounding learning with multimodal learning and interaction
Oct. 22nd Hassan Saber (SequeL) Structured bandits
Oct. 23rd Junpei Komiyama (University of Tokyo) A simple way to deal with cherry-picking
Oct. 26th Han Shao (Chinese University of Hong-Kong) Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs
Nov. 19th Tanguy Levent (École Polytechnique)
Nov. 22nd, 11am Bruno Raffin (Datamove, Inria Grenoble) From Computing at Scale to Learning at Scale
Nov. 23rd Pierre Ménard (Université de Toulouse) Gradient Ascent for Active Exploration in Bandit Models
Nov. 29th Abbas Mehrabian (McGill University) Density estimation of mixtures of Gaussians and Ising models
Nov. 30th Éric Piette (University of Maastricht) A Stochastic Constraint-Based Approach to General Game Playing
Nov. 30th Xiaotian Yu (Chinese University of Hong-Kong) A Generic Approach for Accelerating Stochastic Zeroth-Order Convex Optimization
Jan. 18th, 2019 Thibault Defourneau (CRIStAL) Structure theory of Recurrent Neural Networks: a control-theoretic perspective

2017/2018

Nov 24thRémy Degenne (ENS Paris-Saclay)Using structural information to get low regret: two bandit cases
Dec 15thSteffen Grünewalder (Lancaster University)Random times, filtrations and the bandit problem
Dec 22ndLilian Besson (CentraleSupélec & SequeL)Multi-player bandits revisited
Jan 12thCédric Malherbe (ENS Cachan)Global optimization of Lipschitz and non-smooth functions
Feb 2ndour new PhD candidatesPhD presentation day
March 9thTor Lattimore (DeepMind)Cleaning up the Neighbourhood: A Full Classification for Adversarial Partial Monitoring
April 6thEdouard Oyallon (CentraleSupelec)Invariance and inversibility in CNNs
April 16thJill-Jenn Vie (Riken AIP) (Deep?) Factorization Machines for Optimizing Human LearningMonday, 2pmApril 20thPierre Sennelart (Ecole Normale Supérieure)Reinforcement learning for intensional data management
May 3rdAndrea Locatelli (Université de Magdebourg)Smoothness and adaptivity in zero-th order nonparametric optimizationThursday, 11amMay 4thRégis Sabbadin (MIA Toulouse)Epidémiologie et conservation de la biodiversité : Défis pour la recherche sur les Processus Décisionnels de Markov
May 11thBruno Scherrer (Inria Nancy)Beyond the one-step greedy operator
June 1stAudrey Durand (MC Gill)Machine Learning applications to Healthcare: An overview

Date Speaker Title Comment
Sept 7th Pierre Perrault (SequeL & ENS Paris-Saclay) Expanding searches on trees and MAB
Sept 15th Aurélien Bellet (Inria MAGNET) Decentralized Collaborative Learning of Personalized Models over Networks
Sept 22nd Sadegh Talebi (KTH) Stochastic Combinatorial Optimization under Bandit Feedback and Its Application to Online Shortest Path Routing
Sept 29th Quentin Berthet (University of Cambridge) Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
Oct 6th Subhojyoti Mukherjee (IIT Madras) Thresholding Bandits with Augmented UCB
Oct 27th Tim van Erven (Leiden University) Welcome to the Zoo: Fast Rates in Statistical and Online Learning

Past seminars, 2016/2017:

Date Speaker Title Comment
Sept 20th Bruno Scherrer (Inria Nancy) On the complexity of Policy Iteration (for Deterministic Problems) Tuesday, 2PM
Sept 30th Jean Lafond (Telecom ParisTech) On the Online Frank-Wolfe algorithms for General Low Rank Matrix Completion
Oct 14th Emile Contal (ENS Cachan) The geometry of Gaussian processes and Bayesian Optimization
Nov 4th Ronan Fruit (SequeL) Exploration-Exploitation in MDPs with Options
Nov 18th Claire Vernade (Telecom ParisTech) Insights on Multiple-Plays Bandits with Censored Feedbacks room A11
Nov 25th Ludovic Denoyer (LIP 6, UPMC) Learning Sequential Predictive Models under Budget Constraints
Nov 28th Andras Gyorgy (Imperical College) Following the Leader and Fast Rates in Linear Prediction Monday, 11.15am, room A00
Dec 16th Joon Kwon (Ecole Polytechnique) Mirror descent strategies for regret minimization and approachability
Jan 20th Daniele Calandriello (SequeL) Pack only the essentials: distributed sequential sampling for adaptive kernel dictionary learning
Feb 3rd Jaouad Mourtada Streaming aggregation of a growing number of experts
Feb 10th Pierre Gaillard (Inria Paris) Sparse Accelerated Exponential Weights
March 3rd James Ridgway (SequeL) Sampling normalizing constants in high dimension using inhomogeneous diffusions
March 10th Anna Harutyunyan (VUB) Potential-based reward shaping as a tool to safely incorporate auxiliary information
March 17th Aditya Gopalan (Indian Institute of Science) Collaborative bandits on a network
March 24th Benjamin Guedj (MODAL) A quasi-Bayesian perspective to NMF: theory and applications
March 31st Tristan Cazenave (LAMSADE) Improvements to Monte-Carlo Tree Search
May 5th Ralph Bourdoukan (SequeL) The role of balance between excitation and inhibition in learning in spiking networks
May 16th Emmanuel Daucé (LIF) Learning in actionable universes 2pm, room A00
June 9th Olivier Catoni (CNRS) Markov substitute processes : a new model for linguistics and beyond
June 16th Xuedong Shang (SequeL) Hierarchical Bandits for black-box optimization
June 23rd Guillaume Gautier (SIGMA & SequeL) Zonotope hit-and-run for efficient sampling from projection DPPs

2015/2016:

Date Speaker Title Comment
Oct 30th Akram Erraqabi Pliable rejection sampling and error-regret tradeoff for a MAB problem
Nov 5th Audrey Durand Bandits for healthcare Thursday, 2:00 PM
Nov 6th Romain Warlop Hierarchical Exploration for Accelerating Contextual Bandits reading group
Nov 12th Maluuba Presentation of the company Thursday, 10:30 AM, room A11
Nov 20th Florian Strub Deep Learning for Dummies
Nov 27th Christos Dimitrakakis Differential Privacy, Bayesian Inference and More
Jan 15th MAGNET: Mark Herbster Predicting a Switching Sequence of Graph Labelings 2pm, B21
Jan 29th Bilal Piot Score-based Inverse Reinforcement Learning
Feb 12th The NIPSers NIPS Debrief (3 papers) 10h30-12h
Feb 19th Richard Combes Learning to Rank : Regret Lower Bounds and Efficient Algorithms
Mar 3rd Christopher Dance Gittins Index Theorem and Calculating Gittins Indices Thursday, 3.30 pm
Mar 4th Christopher Dance When are Kalman-Filter Restless Bandits Indexable? Linking Whittle indices, maps-with-gaps and mechanical words.
Mar 18th Emilie Kaufmann Optimal Best Arm Identification with Fixed Confidence
Apr 1st Vianney Perchet Online Learning in repeated auctions
Apr 15th Wouter M. Koolen MetaGrad: Faster Convergence Without Curvature in Online Convex Optimization
Apr 22nd Tomáš Kocák Online learning with noisy side observations 2 p.m.
May 13th Rémi Bardenet Monte Carlo with Determinental Point Processes
May 27th Cricia Zilda Felicio Paixao Extending model-based recommenders to deal with user cold-start problem

2015 spring/summer:

Date Speaker Title Comment
Jan 8th Thibaut Munzer Inverse Reinforcement Learning in Relational Domain
Jan 16th Gergely Neu Online learning in Markov decision processes 2:00 PM
Jan 30th Hadrian Glaude Subspace Identification for Predictive State Representation by Nuclear Norm Minimization
Feb 20th Romaric Gaudel Online Matrix Completion Through Nuclear Norm Regularisation
Feb 27th Jean-Bastien Grill Learning to Optimize via Information-Directed Sampling reading group
Mar 6th Peter Grünwald Learning the learning rate: how to repair Bayes when the model is wrong salle plenière
Mar 20th Benjamin Guedj Aggregation of estimators: Theory and methods
Mar 27th Julien Perolat Conditional Swap Regret and Conditional Correlated Equilibrium reading group
Apr 2nd Pratik Gajane REX3: An algorithm for the Adversarial Dueling Bandit problem
Apr 24th Gergely Neu Exploiting easy data in online optimization
Jun 19th Marta Soare Sequential Transfer of Samples in Linear Bandit
Jun 26th Julien Perolat Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
Jun 30th Jacob Abernethy Minimax Solutions, Random Playouts, and Perturbations

2014 autumn/winter:

Date Speaker Title Comment
Oct 3rd Frédéric Guillou Ensemble Contextual Bandits for Personalized Recommendation reading group
Oct 10th Tomáš Kocák Eluder Dimension and the Sample Complexity of Optimistic Exploration reading group
Oct 24th Daniele Calandriello Sparse Multi-task Reinforcement Learning
Oct 31st Julien Perolat Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor reading group
Nov 4th Aivar Sootia Shaping Pulses to Control Multi-Stable Biological Systems
Nov 7th Bilal Piot Apprentissage par imitation et tranfert de tâche pour une interaction homme-machine narurelle
Nov 13th Pratik Gajane Studies in multiarmed bandits and PhD topic overview
Nov 19th Michael Bowling Algorithms for computing game theoretic solutions to extremely large extensive games
Nov 21st Timothé Collet Active Learning for Classification: An Optimitic Approach
Nov 28th Tomáš Kocák Efficient learning by implicit exploration in bandit problems with side observations
Dec 5th Jean-Bastien Grill Hierarchical optimistic optimization for X-armed bandit
Dec 18th Lihong Li Multi-armed Bandits on the Internet salle plenière
Dec 19th Lihong Li On Minimax Optimal Offline Policy Evaluation

2014 spring/summer:

Date Speaker Title
Jan 10th Prashanth L. A. Actor-critic algorithms for risk-sensitive MDPs
Feb 14th Sergio Valcarcel Diffusion strategies for cooperative reinforcement learning
Feb 28th Alessandro Lazaric Regret Bounds for Reinforcement Learning with Policy Advice
Mar 12th Matthieu Geist Around Inverse Reinforcement Learning and Score-based Classification
Mar 14th Bilal Piot Reinforcement Learning with Expert Demonstrations and links with IRL
Apr 28th Jennifer Healey Transportation Futures: Gossiping Cars and Chatty Cities
May 16th Tomáš Kocák Spectral Bandits for Smooth Graph Functions
May 20th Pascal Poupart Online Bayesian Moment Matching for Latent Dirichlet Allocation
May 23rd Djalel Benbouzid Sequential budgeted classification with complex cost-dependency structure
Jun 13th Mylene Maida An overview of some recent results on deformed random matrix models
Jul 16th Marta Soare Best-Arm Identification in Linear Bandits
Sep 5th Prashanth L. A. Stochastic approximation for speeding up LSTD (and LSPI)

Seminars from even longer ago

Comments are closed.