This page presents a partial list of the projects in which I participated or which I coordinated.
- Current projects
- Past projects
ANR project ASTER
Coordinators: (General) Hélène Touzet, Bonsai team, CRIStAL-INRIA, (in Lille) David Hot, Institut Pasteur de Lille,(in Lyon) Vincent Lacroix BAOBAB-ERABLE Teams LBBE-UCBL-INRIA, (in Paris) Jean-Marc Aury, CEA.
The ANR project ASTER proposes to develop algorithms and software for analysing third generation sequencing data. Third generation is an emerging technology for RNA and DNA sequencing that promises to give a better picture for studying transcriptomes, metagenomes and metatranscriptomes of all living organisms. It will be key for discovering new fundamental mechanisms in cell biology, with broad implications in environmental research, health and agriculture. However, analysing the data is computationally challenging due to a very high rate of sequencing errors. There is a pressing need for models and algorithms that can accommodate this new kind of data and that are also scalable.
Stic AmSud project MAIA
Coordinators: (France) Marie-France Sagot, ERABLE Team, Inria; (Brazil) Roberto Marcondes César Jr, Instituto de Matemática e Estatística, Universidade de São Paulo; and Paulo Vieira Milreu, TecSinapse; (Chile) Vicente Acuña, Centro de Modelamiento Matemático, Santiago; and Gonzalo Ruz, University Adolfo Ibañez, Santiago.
This project has two main goals: one methodological that aims to explore how accurately hard problems can be solved theoretically by different approaches – exact, approximate, randomised, heuristic – and combinations thereof, and a second that aims to better understand the extent and the role of interspecific interactions in all main life processes by using the methodological insights gained in the first goal and the algorithms developed as a consequence.
Inria Associated Team ALEGRIA
Coordinator: (France) ERABLE Team, Inria and (Brazil) Instituto de Biologia Molecular do Paraná – Fiocruz-PR, Paraná.
Parasitic protists include agents of human and animal diseases that have a huge impact on world populations and economy. The major public health problems of protozoan organisms come mainly from the phylum Apicomplexa or the Class Kinetoplastida (from the phylum Euglenozoa).
An important subject yet largely under-explored is the fact that most members from these groups are pathogenic while a small fraction is not, which raises the question of what gives origin to the pathogenicity of these parasites. This is the main question we wish to address by means of computational methods and wet-lab experiments.
H2020-MSCA-ETN-2014 project MicroWine
Coordinator: Lars Hestbjerg Hansen, Department of Environmental Science – Enviromental microbiology & biotechnology, Aarhus University, Aarhus, Danemark.
A diverse, complex, and poorly characterised community of microorganisms lies at the heart of the wine. These microorganisms play key roles at all stages of the viniculture and vinification processes, from helping the plants access nutrients from the soil, driving the plants’ health through protection against pathogens, to the fermentation process that transforms the must into wine with its complex array of aromas and flavours.
The main aim of MicroWine is to gain an improved understanding of such microbial community and of its interplay with the wine.
CNRS-UCBL-INRIA International Associated Laboratory LIRIO
Coordinators: (France) ERABLE-BAOBAB LBBE and (Brazil) Labinfo LNCC
The CNRS-UCBL-INRIA International Associated Laboratory (Laboratoire International Associé – LIA) LIRIO builds upon a strong collaboration between the team of a French-Brazilian researcher with a background in discrete mathematics and algorithmics for the life sciences who has made her scientific career in France, since 2001 in the Laboratoire de Biométrie et Biologie Évolutive UMR 5558, and the team of a Brazilian researcher with a background in genetics and bioinformatics, and extensive national and international links in the area of bioinformatics. The research that will be conducted in the LIA will concern putting together all the activities currently conducted by each team separately or that each team has already planned to do, but also new research that the synergy between the two teams will enable to address in future. This synergy represented by the LIA should also allow us to apply for other sources of funding to support the research we wish to develop. Initially, this research will be concentrated on two main axes, one strongly concerned with the host-parasite relationship and the second with micro-environmental genomics and systems biology. Both address complex systems by a broad variety of experimental, bioinformatic and algorithmic approaches that reflect the complementarity of the two teams involved (biology including experimental part for the Brazilian team, algorithmics for the French one) while bioinformatics is a common language between the two. Besides fundamental issues, the two axes may have also important health-related implications. The topics in these two axes belong to one of the five “thématiques au cœur de l’INEE”, namely “Biodiversité et écologie fonctionnelle”, and cover three “thèmes d‘interface”, namely “Biodiversité, structure, dynamique et fonctionnalité”, “Mécanismes d’adaptation et d’évolution” and “Environment et santé”. Training will represent another key aspect of the LIA, and will aim at extending the already intensive exchanges of researchers, Master and PhD students between the two French and Brazilian partners of the LIA. The bioinformatic aspect of the two axes of research, both sequencing and data analysis, will also greatly benefit from an interaction between the platforms with which each partner is involved in her own country.
Associated with LIRIO, there are also a number of projects whose description may be found here.
This list is incomplete. It contains only the projects that started from 2009.
Coordinator: BAMBOO (sole partner)
ERC Advanced Grant SISYPHE
Coordinator: BAMBOO that then became ERABLE (sole partner)
Symbiosis, or at least its extent, role and precise nature are controversial but symbiosis appears also essential to understand some of the most fundamental evolutionary and functional questions related to living organisms. Nevertheless, although symbiotic relationships have been studied by biologists since probably the early 19th century, they remain little studied by computational biologists. Yet the enormous variety in the observed types of pair- and multi-wise symbiotic relations, and the fact that these relationships touch upon almost every aspect of biology, from molecular to ecological, raise formidable mathematical and computational issues. Addressing some of the main such issues to arrive at a better understanding of the processes of “acquisition and maintenance of one or more organisms by another” and of the (co)evolution of “novel structures and metabolism” with which such processes are associated is the purpose of this project. The approach proposed blends a mathematical (combinatorial, statistical) exploration of the huge variety of genomic and biochemical landscapes observed in the symbiont world, and at the interface between symbionts and hosts or of both and their environment, together with wet-lab experiments.
INRIA International Partnership AMICI
Coordinators: (France) BAMBOO INRIA, (Italy) Universities of Florence, Pisa and Rome La Sapienza, (Netherlands) Free University and CWI Amsterdam
The Inria International partnership AMICI followed up from the INRIA Associated Team SIMBIOSI that started in January of 2009 and ended in December of 2011. The three coordinators of SIMBIOSI were Marie-France Sagot for the INRIA, Alberto Marchetti-Spaccamela (University of Rome La Sapienza) and Leen Stougie (Free University of Amsterdam and CWI).
The members of AMICI were the three above, together with the whole of BAMBOO, plus:
- Nadia Pisanti and Roberto Grossi from the University of Pisa, Italy, with whom we have been collaborating since the PhD of Nadia Pisanti in 2002′
- Pierluigi Crescenzi from the University of Florence, Italy, who joined the Associated Team SIMBIOSI as collaborator early on in its creation (in mid-2009). He is also since March 2011 co-supervisor of two PhD students (Gustavo Sacomoto and Beatrice Donati) with Marie-France Sagot (PhDs funded by the ERC AdG Sisyphe).
BAMBOO proposed an evolution of AMICI towards another structure, that of a European Inria Project-Team called ERABLE currently under way. ERABLE exists already since January 1st, 2015, as an Inria “Center-Team”.
Coordinator: (General) Pierre Peterlongo GenScale INRIA, (in Lyon) Vincent Lacroix BAMBOO INRIA-LBBE-UCBL, (in Montpellier) Eric Rivals LIRMM
The main goal of the Colib’read was to design new algorithms dedicated to the extraction of biological knowledge from raw (non assembled) data produced by High Throughput Sequencers (HTS), also called Next Generation Sequencers (NGS).
A few years ago, genomics witnessed an unprecedentedly deep change with the advent of High Throughput Sequencing (HTS), also known as Next Generation Sequencing (NGS). These technologies generate data of a new type in huge volumes. Crucial computational developments are needed to take full advantage of these data. Our project proposed an original way of extracting information from such data. Usually, a generic assembly (pretreatment) is applied to the data, and then, in a second step, any information of interest is extracted. Our aim was to avoid this protocol that leads to a significant loss of information, or generates chimerical results because of the heuristics used in the assembly. Instead, we developed a set of innovative methods for extracting information of biological interest from HTS data that bypass any costly and often inaccurate assembly step. Importantly, the developed methods do not require the availability of a reference genome. This broadened considerably the spectrum of applications of our methods. Shortly, for each biological question, our general approach consisted in 1) defining a model for the searched elements; 2) detecting in one or several HTS datasets those elements that fit the model; 3) outputting those together with a score and their genomic neighbourhood. From a computational viewpoint, our proposal relied on a formal model based on the De-Bruijn graph structure to develop algorithms able to handle a huge amount of data. Among others, Colib’read delivered algorithms based on the De-Bruijn graph, and tools validated by biologists.
This project was at the interface between (i) fundamental computational questions, (ii) algorithmic developments including the design of ad-hoc indexes and parallelisation, and (iii) biological applications for validation. Finally (iv) it also proposed a large public and educational dissemination.
More information on it may be found here.
FP7 KBBE BacHBerry
The FP7 KBBE BacHBerry project (“BACterial Hosts for production of Bioactive phenolics from bERRY fruits”) started in November 2013 and ended in October 2016.
Coordinator: Jochen Förster, Novo Nordisk Foundation Center for Biosustainability (CFB), Copenhagen, Danemark
The main objective of BacHBerry was to develop innovative methodologies for tapping the commercial potential of plant metabolites, namely phenolic compounds in berry fruits, overcoming current scientific and technological barriers in the field of bio-industry, for the generation of bacterial platforms for sustainable, bio-based production of the desired plant metabolites.
Plants synthesize a staggering variety of secondary metabolites, and this chemodiversity is a poorly used pool of natural molecules with bioactive properties of importance for applications in the pharma and food industries. BacHBerry focused on phenolic compounds, a large and diverse class of plant metabolites, which are currently in the spotlight due to their claimed beneficial effects in prevention and treatment of chronic diseases, but that also have applications as cosmetics, flavours and food colorants etc. Berries are soft and colourful fruits, with great diversity, high content and unique profiles in phenolic compounds, making them a major source of these high-value metabolites. The BacHBerry project aimed to develop a portfolio of sustainable methodologies to mine the potential of the untapped biodiversity of the bioactive phenolic compounds in an extensive collection of berry species. Full exploitation of this unrivalled natural resource requires an integrated and comprehensive effort from bioprospecting in berries using SMART high-throughput screens for the valorisation of phenolic bioactivities aligned with their identification using cutting edge analytics and subsequent elucidation of their biosynthetic pathways. This knowledge will facilitate metabolic engineering of suitable bacterial hosts for high-value phenolics production in scalable fermentation bioprocesses, ultimately serving as commercial production platforms. The consortium comprised a full chain of research and innovation, with 12 research groups, 5 SMEs and a large enterprise, representing 10 countries including partners from ICPC countries Russia, Chile and China, with the capacity to exploit novel bioactivities from berry fruits diversity. BacHBerry developed a pipeline of sustainable and cost-effective processes to facilitate production of added-value berry phenolics with immediate potential for commercialisation and consequent socio-economic benefits for the European community and beyond.
For more information on BacHBerry, see http://www.bachberry.eu/.