Games and sequential decision making under uncertainty
- A gym environment for Backgammon
- A gym environment for the Rubik’s cube
- An environment for autonomous driving decision-making
- Crazystone, the award-winning Go software player
- Brennus, our Poker software player
Supervised learning
- iso-lambda-descent for supervised classification and regression (see this paper)
- ECON for supervised classification and regression (see this paper)
- 1 class nu-SVM for classification (this software was begun while Stéphane was post-doc in SequeL)
Optimization / bandits
- OOR, Optimistic Optimization in R
- KernelUCB, Kernel UCB: kernelised version for contextual linear bandits
- SOO, Simultaneous Optimistic Optimization
- StoSOO, Stochastic Simultaneous Optimistic Optimization
- Yet an other StoSOO (in C)
- POO, Parallel Optimistic Optimization
Markov decision processes
Sketching
- SQUEAK: Distributed adaptive sampling for kernel matrix approximation
Sampling
- DPPy: Sampling Determinantal Point Processes with Python
