Health and social sciences

Health and social sciences

Electronic health records

Electronic health records are typically data collected by the care system. As a result, they cover a large number of individuals over a long period of time. However, they are observational data: interventions, such as drug prescription, have not been done according to a systematic experimental plan. As such, they raise many causal inference challenges (which we study in link with machine learning). In addition, they are complex relational data, structured for clinical practices or accounting, and not research. Analyzing them faces challenges of missing values, as well as statistical modeling across multiple sources of relational data with varying representations. The databases are full of non normalized snippets of text, for which we develop dedicated NLP approaches.

We apply our statistical learning research on large electronic health records (in partnership with APHP, HAS, and on the French healthcare system: SNDS). The related questions are that of personalized medicine (individualized prediction models) as well as treatment efficacy, for instance to answer public-health questions.

Epidemiological reasoning

From a health-research perspective, the questions are typical of epidemiology and draw from the corresponding line of thoughts: all kind of biases must be accounted for, such as a sampling bias between the study population and the target. Where we innovate is that we strive to replace the traditional parametric models with non-parametric estimates built from machine-learning models.

AP-HP’s “entrepot de données de santé”

We have a tight collaboration with AP-HP, the Paris hospitals, that unite 39 different hospital in a consistent database covering 10 million patients a year. This collaboration was born at the beginning of the COVID crisis, where we used the corresponding database (known as the EDS, “entrepôt de données de santé”) to follow the unfolding of a new disease, building on the fly prognosis models and evaluating treatment efficiency.

Educational data mining, learning analytics

Learning platforms collect a large amount of data that can be used to personalize learning. Educational data mining is about developing mathematical models of student learning (knowledge tracing, spaced repetition systems), for example students attempting programming exercises. Learning analytics is focused on representing information for teacher decision-making (e.g. dashboards in MOOCs).

Related publications

Medical applications

Publications HAL titre medicine de gael varoquaux

titre
Prediction, Not Association, Paves the Road to Precision Medicine
auteur
Danilo Bzdok, Gael Varoquaux, Ewout Steyerberg
article
JAMA Psychiatry, 2021, 78 (2), pp.127. ⟨10.1001/jamapsychiatry.2020.2549⟩
Accès au bibtex
BibTex

Education-related

Publications HAL titre student de jill-jenn vie

titre
Interpretable Knowledge Tracing: Simple and Efficient Student Modeling with Causal Relations
auteur
Sein Minn, Jill-Jênn Vie, Koh Takeuchi, Hisashi Kashima, Feida Zhu
article
Proceedings of the AAAI Conference on Artificial Intelligence, Feb 2022, Vancouver, Canada. pp.12810-12818, ⟨10.1609/aaai.v36i11.21560⟩
Accès au texte intégral et bibtex
https://inria.hal.science/hal-03895625/file/IKT_EAAI.pdf BibTex
titre
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of Skills
auteur
Benoît Choffin, Fabrice Popineau, Yolaine Bourda, Jill-Jênn Vie
article
JDSE 2019 – Paris-Saclay Junior Conference on Data Science and Engineering, Sep 2019, Gif-sur-Yvette, France
Accès au texte intégral et bibtex
https://hal.science/hal-03427048/file/DAS3H_JDSE_2019.pdf BibTex

COVID-related

Publications HAL titre covid de gael varoquaux

titre
International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality
auteur
Griffin Weber, Chuan Hong, Zongqi Xia, Nathan Palmer, Paul Avillach, Sehi L’yi, Mark Keller, Shawn Murphy, Alba Gutiérrez-Sacristán, Clara-Lea Bonzel, Arnaud Serret-Larmande, Antoine Neuraz, Gilbert Omenn, Shyam Visweswaran, Jeffrey Klann, Andrew South, Ne Hooi Will Loh, Mario Cannataro, Brett Beaulieu-Jones, Riccardo Bellazzi, Giuseppe Agapito, Mario Alessiani, Bruce Aronow, Douglas Bell, Vincent Benoit, Florence Bourgeois, Luca Chiovato, Kelly Cho, Arianna Dagliati, Scott Duvall, Noelia García Barrio, David Hanauer, Yuk-Lam Ho, John Holmes, Richard Issitt, Molei Liu, Yuan Luo, Kristine Lynch, Sarah Maidlow, Alberto Malovini, Kenneth Mandl, Chengsheng Mao, Michael Matheny, Jason Moore, Jeffrey Morris, Michele Morris, Danielle Mowery, Kee Yuan Ngiam, Lav Patel, Miguel Pedrera Jiménez, Rachel Ramoni, Emily Schriver, Petra Schubert, Pablo Serrano Balazote, Anastasia Spiridou, Amelia Tan, Byorn Tan, Valentina Tibollo, Carlo Torti, Enrico Trecarichi, Xuan Wang, James Aaron, Adem Albayrak, Giuseppe Albi, James Balshi, Anna Alloni, Danilo Amendola, François Angoulvant, Brett Beaulieu-Jones, Li Anthony, Fatima Ashraf, Andrew Atz, Paul Avillach, Paula Azevedo, Antonio Bellasi, Vincent Benoit, Michele Beraghi, José Luis Bernal-Sobrino, Mélodie Bernaux, Romain Bey, Surbhi Bhatnagar, Alvar Blanco-Martínez, Martin Boeker, John Booth, Silvano Bosari, Robert Bradford, Gabriel Brat, Stéphane Bréant, Nicholas Brown, Raffaele Bruno, William Bryant, Mauro Bucalo, Emily Bucholz, Anita Burgun, Tianxi Cai, Aldo Carmona, Charlotte Caucheteux, Julien Champ, Krista Chen, Jin Chen, Lorenzo Chiudinelli, Kelly Cho, James Cimino, Tiago Colicchio, Sylvie Cormont, Sébastien Cossin, Jean Craig, Juan Luis Cruz-Bermúdez, Jaime Cruz-Rojo, Mohamad Daniar, Christel Daniel, Priyam Das, Batsal Devkota, Lana Garmire, Audrey Dionne, Rui Duan, Julien Dubiel, Loic Esteve, Hossein Estiri, Shirley Fan, Robert Follett, Thomas Ganslandt, Noelia García-Barrio, Nils Gehlenborg, Emily Getzen, Alon Geva, Tobias Gradinger, Alexandre Gramfort, Romain Griffier, Nicolas Griffon, Olivier Grisel, Alba Gutiérrez-Sacristán, Larry Han, David Hanauer, Christian Haverkamp, Daniel Key, Derek Hazard, Bing He, Darren Henderson, Martin Hilka, Kenneth Huling, Meghan Hutch, Richard Issitt, Anne Sophie Jannot, Vianney Jouhet, Ramakanth Kavuluru, Chris Kennedy, Kate Kernan, Katie Kirchoff, Jeffrey Klann, Isaac Kohane, Ian Krantz, Detlef Kraska, Ashok Krishnamurthy, Trang Le, Judith Leblanc, Guillaume Lemaitre, Leslie Lenert, Damien Leprovost, Molei Liu, Qi Long, Sara Lozano-Zahonero, Sadiqa Mahmood, Sarah Maidlow, Adeline Makoudjou, Anupama Maram, Patricia Martel, Marcelo Martins, Jayson Marwaha, Aaron Masino, Maria Mazzitelli, Arthur Mensch, Marianna Milano, Marcos Minicucci, Bertrand Moal, Taha Mohseni Ahooyi, Jason Moore, Cinta Moraleda, Jeffrey Morris, Karyn Moshal, Sajad Mousavi, Douglas Murad, Shawn Murphy, Thomas Naughton, Carlos Tadeu Breda Neto, Jane Newburger, Kee Yuan Ngiam, Wanjiku Njoroge, James Norman, Jihad Obeid, Marina Okoshi, Karen Olson, Gilbert Omenn, Nina Orlova, Brian Ostasiewski, Nathan Palmer, Nicolas Paris, Lav Patel, Miguel Pedrera-Jiménez, Ashley Pfaff, Emily Pfaff, Danielle Pillion, Sara Pizzimenti, Hans Prokosch, Robson Prudente, Andrea Prunotto, Víctor Quirós-González, Rachel Ramoni, Maryna Raskin, Siegbert Rieg, Gustavo Roig-Domínguez, Pablo Rojo, Paula Rubio-Mayo, Paolo Sacchi, Carlos Sáez, Elisa Salamanca, Malarkodi Jebathilagam Samayamuthu, L. Nelson Sanchez-Pinto, Arnaud Sandrin, Nandhini Santhanam, Janaina Santos, Fernando Sanz Vidorreta, Maria Savino, Juergen Schuettler, Luigia Scudeller, Neil Sebire, Pablo Serrano-Balazote, Patricia Serre, Arnaud Serret-Larmande, Mohsin Shah, Zahra Shakeri Hossein Abad, Domenick Silvio, Piotr Sliz, Jiyeon Son, Charles Sonday, Andrew South, Francesca Sperotto, Zachary Strasser, Amelia Tan, Bryce Tan, Suzana Tanni, Deanne Taylor, Ana Terriza-Torres, Patric Tippmann, Emma Toh, Yi-Ju Tseng, Andrew Vallejos, Gael Varoquaux, Margaret Vella, Guillaume Verdy, Jill-Jênn Vie, Shyam Visweswaran, Michele Vitacca, Kavishwar Wagholikar, Lemuel Waitman, Demian Wassermann, Griffin Weber, Martin Wolkewitz, Scott Wong, Zongqi Xia, Xin Xiong, Ye Ye, Nadir Yehya, William Yuan, Alberto Zambelli, Harrison Zhang, Daniela Zöller, Valentina Zuccaro, Chiara Zucco, Isaac Kohane, Tianxi Cai, Gabriel Brat
article
npj Digital Medicine, 2022, 5 (1), pp.74. ⟨10.1038/s41746-022-00601-0⟩
Accès au bibtex
BibTex
titre
External validation of prognostic scores for COVID-19: a multicenter cohort study of patients hospitalized in Greater Paris University Hospitals
auteur
Yannis Lombardi, Loris Azoyan, Piotr Szychowiak, Ali Bellamine, Guillaume Lemaitre, Mélodie Bernaux, Christel Daniel, Judith Leblanc, Quentin Riller, Olivier Steichen, Pierre-Yves Ancel, Alain Bauchet, Nathanael Beeker, Vincent Benoit, Romain Bey, Aurélie Bourmaud, Stéphane Bréant, Anita Burgun, Fabrice Carrat, Charlotte Caucheteux, Julien Champ, Sylvie Cormont, Julien Dubiel, Catherine Duclos, Loic Esteve, Marie Frank, Nicolas Garcelon, Alexandre Gramfort, Nicolas Griffon, Olivier Grisel, Martin Guilbaud, Claire Hassen-Khodja, François Hemery, Martin Hilka, Anne Sophie Jannot, Jerome Lambert, Richard Layese, Léo Lebouter, Damien Leprovost, Ivan Lerner, Kankoe Levi Sallah, Aurélien Maire, Marie-France Mamzer, Patricia Martel, Arthur Mensch, Thomas Moreau, Antoine Neuraz, Nina Orlova, Nicolas Paris, Bastien Rance, Hélène Ravera, Antoine Rozes, Pierre Rufat, Elisa Salamanca, Arnaud Sandrin, Patricia Serre, Xavier Tannier, Jean-Marc Treluyer, Damien van Gysel, Gael Varoquaux, Jill-Jênn Vie, Maxime Wack, Perceval Wajsburt, Demian Wassermann, Eric Zapletal
article
Intensive Care Medicine, 2021, 47 (12), pp.1426-1439. ⟨10.1007/s00134-021-06524-w⟩
Accès au texte intégral et bibtex
https://inria.hal.science/hal-03967472/file/s00134-021-06524-w.pdf BibTex
titre
International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium
auteur
Gabriel A. Brat, Griffin M. Weber, Nils Gehlenborg, Paul Avillach, Nathan P. Palmer, Luca Chiovato, James Cimino, Brett K. Beaulieu-Jones, Sehi L’Yi, Mark S. Keller, Douglas S. Bell, Robert W. Follett, Lav P. Patel, Anne Sophie Jannot, Lemuel R. Waitman, Gilbert Omenn, Alberto Malovini, Jason H. Moore, Valentina Tibollo, Shawn N Murphy, Riccardo Bellazzi, David A Hanauer, Arnaud Serret-Larmande, Alba Gutierrez-Sacristan, John J Holmes, Douglas Bell, Kenneth D. Mandl, Jeffrey G Klann, Douglas A Murad, Luigia Scudeller, Mauro Bucalo, Katie Kirchoff, Jean Craig, Jihad Obeid, Vianney Jouhet, Romain Griffier, Sébastien Cossin, Bertrand Moal, Antonio Bellasi, Hans U Prokosch, Detlef Kraska, Piotr Sliz, Amelia L.M. Tan, Kee Yuan Ngiam, Alberto Zambelli, Danielle L Mowery, Emily Schiver, Batsal Devkota, Robert Bradford, Mohamad Daniar, Christel Daniel, Vincent Benoit, Romain Bey, Nicolas Paris, Patricia Serre, Nina Orlova, Julien Dubiel, Martin Hilka, Stephane Breant, Judith Leblanc, Nicolas Griffon, Anita Burgun, Melodie Bernaux, Arnaud Sandrin, Elisa Salamanca, Sylvie Cormont, Thomas Ganslandt, Tobias Gradinger, Julien Champ, Martin Boeker, Patricia Martel, Loïc Estève, Alexandre Gramfort, Olivier Grisel, Damien Leprovost, Thomas Moreau, Gael Varoquaux, Jill-Jênn Vie, Demian Wassermann, Arthur Mensch, Charlotte Caucheteux, Christian Haverkamp, Guillaume Lemaître, Silvano Bosari, Andrew South, Tianxi Cai, Isaac Kohane
article
npj Digital Medicine, 2020, 3 (1), pp.#109. ⟨10.1038/s41746-020-00308-0⟩
Accès au texte intégral et bibtex
https://hal.science/hal-02918344/file/covid_ehr.pdf BibTex
titre
Hydroxychloroquine with or without azithromycin and in-hospital mortality or discharge in patients hospitalized for COVID-19 infection: a cohort study of 4,642 in-patients in France
auteur
Emilie Sbidian, Julie Josse, Guillaume Lemaître, Imke Mayer, Melodie Bernaux, Alexandre Gramfort, Nathanaël Lapidus, Nicolas Paris, Antoine Neuraz, Ivan Lerner, Nicolas Garcelon, Bastien Rance, Olivier Grisel, Thomas Moreau, Ali Bellamine, Pierre Wolkenstein, Gaël Varoquaux, Eric Caumes, Marc Lavielle, Armand Mekontso Dessap, Etienne Audureau
article
2020
Accès au bibtex
BibTex