Software

Games and sequential decision making under uncertainty

A gym environment for Backgammon
A gym environment for the Rubik’s cube
An environment for autonomous driving decision-making

Crazystone, the award-winning Go software player
Brennus, our Poker software player

Supervised learning

iso-lambda-descent for supervised classification and regression (see this paper)
ECON for supervised classification and regression (see this paper)
1 class nu-SVM for classification (this software was begun while Stéphane was post-doc in SequeL)

Optimization / bandits

OOR, Optimistic Optimization in R
KernelUCB, Kernel UCB: kernelised version for contextual linear bandits
SOO, Simultaneous Optimistic Optimization
StoSOO, Stochastic Simultaneous Optimistic Optimization
Yet an other StoSOO (in C)
POO, Parallel Optimistic Optimization

Markov decision processes

BAC Bayesian Actor-Critic Algorithm
OP Optimistic Planning for MDPs

Sketching

SQUEAK: Distributed adaptive sampling for kernel matrix approximation

Sampling

DPPy: Sampling Determinantal Point Processes with Python

Misc.

A library to control Khepera II robots (en français)

Comments are closed.