SequeL seminars are held in room A00 on Friday mornings, at 11am (any change of schedule is reported in the Comment section). Seminars either present original work of the speaker, or part of the reading group talk series. You can also find all events in the public Google Calendar feed of the seminar series, or on the seminars platform of the University of Lille.
2019/2020
Date | Speaker | Title | Comment |
---|---|---|---|
Sep. 12th | Yasin Abasi Yadkori (VinAI) | Efficient exploration in sequential decision making problems | Thursday, 11am, room A00 |
Sept 17th | Anders Jonsson (UPF, SequeL) | Overview of Anders’ works | Tuesday, 11am, room A00 |
Oct 4th | Mathieu Seurin (SequeL) | Reinforcement Learning from forbidden actions and Instruction following | Friday, 11am, room A00 |
Oct 11th | Rashid Bakirov | Automating the design of adaptive machine learning algorithms following | Friday, 4pm, room A00 |
Oct 18th | Yannis Flet-Berliac (SequeL) | Deep Reinforcement Learning: Improving Policy Gradient Updates | Friday, 11am, room A00 |
Oct 25th | Ana Bušić (Inria DYOGENE) | Zap Stochastic Approximation and Reinforcement Learning | Friday, 11am, room A00 |
Oct 28th | Arthur Mensch (ENS Paris) | Predicting probability distributions from data: a smoothing perspective | Monday, 11am, room A00 |
Nov 8th | Lilian Besson (CentraleSupélec Rennes, SequeL) | Multi-players Bandit Algorithms for Internet of Things Networks | Salle Plénière |
Nov 14th | Arun Verma (IIT Bombay) | Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback (Slides) |
Thursday, 2pm, room A00 |
Nov 22nd | Jill-Jênn Vie (SequeL) | Learning representations of humans that evolve over time | Friday, 11am, Salle Plénière |
Nov 29th | Johannes Kirschner (ETH) | Beyond Optimism: Information-Directed Exploration in Bandits (Slides) | Friday, 11am, room A00 |
Dec 6th | Rémi Besson (Polytechnique) | TBA |
2018/2019
Date | Speaker | Title | Comment |
---|---|---|---|
Sep. 21st | Hazem Wannous (3DSAM, CRIStAL) | Online hand gesture recognition using combined convolutional and recurrent networks | |
Oct. 12th | Florian Strub (SequeL) | Language grounding learning with multimodal learning and interaction | |
Oct. 22nd | Hassan Saber (SequeL) | Structured bandits | |
Oct. 23rd | Junpei Komiyama (University of Tokyo) | A simple way to deal with cherry-picking | |
Oct. 26th | Han Shao (Chinese University of Hong-Kong) | Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs | |
Nov. 19th | Tanguy Levent (École Polytechnique) | ||
Nov. 22nd, 11am | Bruno Raffin (Datamove, Inria Grenoble) | From Computing at Scale to Learning at Scale | |
Nov. 23rd | Pierre Ménard (Université de Toulouse) | Gradient Ascent for Active Exploration in Bandit Models | |
Nov. 29th | Abbas Mehrabian (McGill University) | Density estimation of mixtures of Gaussians and Ising models | |
Nov. 30th | Éric Piette (University of Maastricht) | A Stochastic Constraint-Based Approach to General Game Playing | |
Nov. 30th | Xiaotian Yu (Chinese University of Hong-Kong) | A Generic Approach for Accelerating Stochastic Zeroth-Order Convex Optimization | |
Jan. 18th, 2019 | Thibault Defourneau (CRIStAL) | Structure theory of Recurrent Neural Networks: a control-theoretic perspective |
2017/2018
Nov 24thRémy Degenne (ENS Paris-Saclay)Using structural information to get low regret: two bandit cases
Dec 15thSteffen Grünewalder (Lancaster University)Random times, filtrations and the bandit problem
Dec 22ndLilian Besson (CentraleSupélec & SequeL)Multi-player bandits revisited
Jan 12thCédric Malherbe (ENS Cachan)Global optimization of Lipschitz and non-smooth functions
Feb 2ndour new PhD candidatesPhD presentation day
March 9thTor Lattimore (DeepMind)Cleaning up the Neighbourhood: A Full Classification for Adversarial Partial Monitoring
April 6thEdouard Oyallon (CentraleSupelec)Invariance and inversibility in CNNs
April 16thJill-Jenn Vie (Riken AIP) (Deep?) Factorization Machines for Optimizing Human LearningMonday, 2pmApril 20thPierre Sennelart (Ecole Normale Supérieure)Reinforcement learning for intensional data management
May 3rdAndrea Locatelli (Université de Magdebourg)Smoothness and adaptivity in zero-th order nonparametric optimizationThursday, 11amMay 4thRégis Sabbadin (MIA Toulouse)Epidémiologie et conservation de la biodiversité : Défis pour la recherche sur les Processus Décisionnels de Markov
May 11thBruno Scherrer (Inria Nancy)Beyond the one-step greedy operator
June 1stAudrey Durand (MC Gill)Machine Learning applications to Healthcare: An overview
Date | Speaker | Title | Comment |
---|---|---|---|
Sept 7th | Pierre Perrault (SequeL & ENS Paris-Saclay) | Expanding searches on trees and MAB | |
Sept 15th | Aurélien Bellet (Inria MAGNET) | Decentralized Collaborative Learning of Personalized Models over Networks | |
Sept 22nd | Sadegh Talebi (KTH) | Stochastic Combinatorial Optimization under Bandit Feedback and Its Application to Online Shortest Path Routing |
|
Sept 29th | Quentin Berthet (University of Cambridge) | Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe | |
Oct 6th | Subhojyoti Mukherjee (IIT Madras) | Thresholding Bandits with Augmented UCB | |
Oct 27th | Tim van Erven (Leiden University) | Welcome to the Zoo: Fast Rates in Statistical and Online Learning |
Past seminars, 2016/2017:
2015/2016:
Date | Speaker | Title | Comment |
---|---|---|---|
Oct 30th | Akram Erraqabi | Pliable rejection sampling and error-regret tradeoff for a MAB problem | |
Nov 5th | Audrey Durand | Bandits for healthcare | Thursday, 2:00 PM |
Nov 6th | Romain Warlop | Hierarchical Exploration for Accelerating Contextual Bandits | reading group |
Nov 12th | Maluuba | Presentation of the company | Thursday, 10:30 AM, room A11 |
Nov 20th | Florian Strub | Deep Learning for Dummies | |
Nov 27th | Christos Dimitrakakis | Differential Privacy, Bayesian Inference and More | |
Jan 15th | MAGNET: Mark Herbster | Predicting a Switching Sequence of Graph Labelings | 2pm, B21 |
Jan 29th | Bilal Piot | Score-based Inverse Reinforcement Learning | |
Feb 12th | The NIPSers | NIPS Debrief (3 papers) | 10h30-12h |
Feb 19th | Richard Combes | Learning to Rank : Regret Lower Bounds and Efficient Algorithms | |
Mar 3rd | Christopher Dance | Gittins Index Theorem and Calculating Gittins Indices | Thursday, 3.30 pm |
Mar 4th | Christopher Dance | When are Kalman-Filter Restless Bandits Indexable? Linking Whittle indices, maps-with-gaps and mechanical words. | |
Mar 18th | Emilie Kaufmann | Optimal Best Arm Identification with Fixed Confidence | |
Apr 1st | Vianney Perchet | Online Learning in repeated auctions | |
Apr 15th | Wouter M. Koolen | MetaGrad: Faster Convergence Without Curvature in Online Convex Optimization | |
Apr 22nd | Tomáš Kocák | Online learning with noisy side observations | 2 p.m. |
May 13th | Rémi Bardenet | Monte Carlo with Determinental Point Processes | |
May 27th | Cricia Zilda Felicio Paixao | Extending model-based recommenders to deal with user cold-start problem |
2015 spring/summer:
Date | Speaker | Title | Comment |
---|---|---|---|
Jan 8th | Thibaut Munzer | Inverse Reinforcement Learning in Relational Domain | |
Jan 16th | Gergely Neu | Online learning in Markov decision processes | 2:00 PM |
Jan 30th | Hadrian Glaude | Subspace Identification for Predictive State Representation by Nuclear Norm Minimization | |
Feb 20th | Romaric Gaudel | Online Matrix Completion Through Nuclear Norm Regularisation | |
Feb 27th | Jean-Bastien Grill | Learning to Optimize via Information-Directed Sampling | reading group |
Mar 6th | Peter Grünwald | Learning the learning rate: how to repair Bayes when the model is wrong | salle plenière |
Mar 20th | Benjamin Guedj | Aggregation of estimators: Theory and methods | |
Mar 27th | Julien Perolat | Conditional Swap Regret and Conditional Correlated Equilibrium | reading group |
Apr 2nd | Pratik Gajane | REX3: An algorithm for the Adversarial Dueling Bandit problem | |
Apr 24th | Gergely Neu | Exploiting easy data in online optimization | |
Jun 19th | Marta Soare | Sequential Transfer of Samples in Linear Bandit | |
Jun 26th | Julien Perolat | Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games | |
Jun 30th | Jacob Abernethy | Minimax Solutions, Random Playouts, and Perturbations |
2014 autumn/winter:
Date | Speaker | Title | Comment |
---|---|---|---|
Oct 3rd | Frédéric Guillou | Ensemble Contextual Bandits for Personalized Recommendation | reading group |
Oct 10th | Tomáš Kocák | Eluder Dimension and the Sample Complexity of Optimistic Exploration | reading group |
Oct 24th | Daniele Calandriello | Sparse Multi-task Reinforcement Learning | |
Oct 31st | Julien Perolat | Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor | reading group |
Nov 4th | Aivar Sootia | Shaping Pulses to Control Multi-Stable Biological Systems | |
Nov 7th | Bilal Piot | Apprentissage par imitation et tranfert de tâche pour une interaction homme-machine narurelle | |
Nov 13th | Pratik Gajane | Studies in multiarmed bandits and PhD topic overview | |
Nov 19th | Michael Bowling | Algorithms for computing game theoretic solutions to extremely large extensive games | |
Nov 21st | Timothé Collet | Active Learning for Classification: An Optimitic Approach | |
Nov 28th | Tomáš Kocák | Efficient learning by implicit exploration in bandit problems with side observations | |
Dec 5th | Jean-Bastien Grill | Hierarchical optimistic optimization for X-armed bandit | |
Dec 18th | Lihong Li | Multi-armed Bandits on the Internet | salle plenière |
Dec 19th | Lihong Li | On Minimax Optimal Offline Policy Evaluation |
2014 spring/summer:
Date | Speaker | Title |
---|---|---|
Jan 10th | Prashanth L. A. | Actor-critic algorithms for risk-sensitive MDPs |
Feb 14th | Sergio Valcarcel | Diffusion strategies for cooperative reinforcement learning |
Feb 28th | Alessandro Lazaric | Regret Bounds for Reinforcement Learning with Policy Advice |
Mar 12th | Matthieu Geist | Around Inverse Reinforcement Learning and Score-based Classification |
Mar 14th | Bilal Piot | Reinforcement Learning with Expert Demonstrations and links with IRL |
Apr 28th | Jennifer Healey | Transportation Futures: Gossiping Cars and Chatty Cities |
May 16th | Tomáš Kocák | Spectral Bandits for Smooth Graph Functions |
May 20th | Pascal Poupart | Online Bayesian Moment Matching for Latent Dirichlet Allocation |
May 23rd | Djalel Benbouzid | Sequential budgeted classification with complex cost-dependency structure |
Jun 13th | Mylene Maida | An overview of some recent results on deformed random matrix models |
Jul 16th | Marta Soare | Best-Arm Identification in Linear Bandits |
Sep 5th | Prashanth L. A. | Stochastic approximation for speeding up LSTD (and LSPI) |