-
Chemfeat
-
The project provides a Python package and command-line tool for generating feature vectors from molecules. The feature sets to include in the feature vectors are configurable via a simple YAML configuration file. Molecules are specified as lists of Inchis.
-
-
https://gitlab.inria.fr/jrye/chemfeat
-
-
Hydronaut
-
A Python framework for machine- and deep-learning that makes it easy to use Hydra for hyperparameter configuration and MLflow for experiment tracking and result distribution. The user only needs to create a single YAML configuration file and a subclass of Hydronaut.Experiment to use the framework.
Hydra allows the user to systematically sweep all hyperparameter combinations or optimize them use different strategies with plugins for libraries such as Optuna.
MLflow provides a web interface, command-line interface and Python API for exploring and sharing the results.
The framework is fully compatible with PyTorch Lightning and provides a custom subclass to facilitate its use.
-
-
https://gitlab.inria.fr/jrye/hydronaut
-
-
jnp
-
A Python library and command-line tool to automatically export presentations from Jupyter notebooks along with custom CSS and other user-configurable website content.
-
-
https://gitlab.inria.fr/jrye/jnp
-
-
Metabograph
-
Metabograph is a Python library for generating metabolic networks from BioPAX data. The generated NetworkX graphs can be used in various application such as training graph neural networks (GNNs) or saved to files for use with viewers such as Gephi.
-
-
https://gitlab.inria.fr/jrye/metabograph
-
-
MLflow Extra
-
Utility scripts and matching python module for working with MLflow directories. It mainly provides functionality for moving around and regrouping "mlruns" directories which is useful in contexts such as migrating results from a cluster.
-
-
https://gitlab.inria.fr/jrye/mlflow-extra
-
-
MolPred
-
A Hydronaut-based framework that uses ChemFeat to generate feature vectors for training machine- and deep-learning models to predict properties of molecules.
-
-
https://gitlab.inria.fr/jrye/molpred
-
-
PyChronicle
-
Python library containing classes for representing sequences and chronicles, ie a representation of a temporal pattern. It implements efficient recognition algorithms to match chronicles in a long sequence.
-
-
https://gitlab.inria.fr/tguyet/pychronicles
-
-
PyPTU
-
Parse PicoQuant PTU files in Python. It includes a Python package and a command-line utility.
-
-
https://gitlab.inria.fr/jrye/pyptu
-
-
SNDS Data Generator
-
The SNDS, formerly SNIIRAM, is a huge database (consisting of several TBs of data and about 700 tables) that contains information about healthcare reimbursements of about 60 million French insured patients. This database is used to carry out epidemiological and medical-economic studies. Due to its sensitive medical content, identifying information (names, social security numbers) is removed or replaced by spurious information. This repository contains a solution to generate a synthetic version of the database.
The software generates relational data compliant with the original database schema with realistic distributions. This guarantees privacy preservation thanks to the use of open data only.
This work is currently under review to the AIME conference.
-
-
https://gitlab.inria.fr/tguyet/medtrajectory_datagen
-
-
SWoTTeD
-
SWoTTeD is a tensor decomposition framework to extract temporal phenotypes from structured data. Most recent decomposition models allow extracting phenotypes that only describe snapshots of typical profiles, also called daily phenotypes. However, SWoTTeD extends the notion of daily phenotype into temporal phenotype describing an arrangement of features over a time window.
-
-
https://hsebia.gitlabpages.inria.fr/swotted/readme.html
-
-
VCNet
-
VCNet is a counterfactual generator for tabular datasets. Counterfactual generation is an explainable artificial intelligence technique that helps users understand the decisions of predictive models.
VCNet handles both categorical and numerical variables and can also manage immutable variables (i.e., variables that cannot be changed while generating counterfactuals). Currently, it is implemented for binary classification only. VCNet is based on conditional variational encoders and offers an easy and fast training architecture that generates realistic counterfactuals (i.e., counterfactuals that likely exist in the data distribution).
This library provides two versions of VCNet:
* Joint-learning architecture: This version includes both the classification model and the counterfactual generator.
* Post-hoc architecture: This version includes only the counterfactual generator, which can be fitted on any prediction model (Torch module).Once the VCNet model is fitted, it can be used to generate efficiently valid counterfactuals of examples.
-
Counterfactual explanation is a common class of methods to make local explanations of machine learning decisions. For a given instance, these methods aim to find the smallest modification of feature values that changes the predicted decision made by a machine learning model. One of the challenges of counterfactual explanation is the efficient generation of realistic counterfactuals.
To address this challenge, we propose VCNet — Variational Counter Net — a model architecture that combines a predictor and a counterfactual generator that are jointly trained, for regression or classification tasks. Once fitted, VCNet is able to both generate predictions, and to generate counterfactual explanations without having to solve another minimization problem.
Our contribution is the generation of counterfactuals that are close to the distribution of the predicted class. This is done by learning a variational autoencoder conditionally to the output of the predictor in a join-training fashion.
The empirical evaluation on tabular datasets and across several interpretability metrics demonstrate that VCNet is competitive with the state-of-the-art methods.
-
https://gitlab.inria.fr/tguyet/vcnet
-