Permanent researchers (PIs)
- Gaël Varoquaux (team leader), Research Director
- Marine Le Morvan, Research Scientist (chargée de recherche)
- Jill-Jênn Vie, Research Scientist (chargé de recherche)
- Judith Abécassis, Research Scientist (ISFP)
Junior research members
Students
- Matthieu Doutreligne – PhD student. Working on transfer learning and causal inference for public health, in partnership with HAS.
- Léo Grinsztajn – PhD Student. Working on neural networks for tabular and relational data.
- Alexandre Perez – PhD Student. Working on supervised learning in the presence of missing values and assessment of classification confidences through calibration and grouping loss.
- Félix Lefebvre – PhD Student. Working on large-scale graph-embedding methods to represent large relational stores.
- Samuel Girard – Intern. Reinforcement learning in education.
- Julie Alberge – Intern. Modeling trajectories of diabetic patients from AP-HP.
Post-docs
- Riccardo Cappuzzo – Post-doc, working on assembling features across relational databases
- Jun Kim – Post-doc, working on graph neural networks for relational databases
- Clémence Réda – Marie Skłodowska-Curie post-doc on project Robust Explainable Controllable Standard for drug Screening (RECeSS)
- Lihu Chen – Natural Language Processing and Large Language Models
Engineers
- Lilian Boulard – software engineering apprentice. Working on skrub
- Tomas Rigaux – data scientist. Working on recommendations for the job market.
- Jovan Stojanovic– data science and software engineer. Working on skrub and data science on databases with normalization errors.
Scikit-learn team
Soda hosts part of the scikit-learn development team, including funding via Inria foundation.
- Arturo Amor-Quiroz – Research software engineer and PhD in physics. Focused on the scikit-learn documentation.
- Jérémie du Boisberranger – Research software engineer and physicist. Core developer to scikit-learn since 2019.
- Franck Charras – Research software engineer. Working on a GPU programming project, within a partnership with Intel®.
- Vincent Maladière – Research software engineer, focusing on data wrangling, survival analysis, and MLOps. Working on scikit-learn, skrub, and hazardous. Collaboration on health data with AP-HP.
- Loïc Estève – Research software engineer and physicist. Core developer to scikit-learn since 2016.
- Olivier Grisel – Research software engineer. Core developer to scikit-learn since 2010.
- François Goupil – Research software engineer. Animates our community, manages the operations of the consortium and the relationship with our patrons
- Guillaume Lemaître – Research software engineer. Core developer to scikit-learn since 2017.
Research Team Assistant
- Marie Énée
Alumni
- Samuel Brasil de Alburquerque – PhD student. Working on diabetes epidemiology from observational health informatics.
- Alexis Cvetkov-Iliev – PhD student. Working on statistical analysis across relational databases with embeddings.
- Bénédicte Colnet – PhD student. Working on causal inference, with a focus on assessing randomized controlled trials’ external validity.
- Julien Jerphanion – Research software engineer. Core developer to scikit-learn since 2021.