Seminar by Patrick Valduriez “Innovation : startup strategies” 20 May 2021.

The 12th edition of the Marcus Evans “Innovation Strategies” conference will be held virtually from May 19 to 20, 2021.

https://drive.google.com/file/d/1HQ8AHzcPbK3eqap8hqGT43TGcHHYZSEt/preview

Check out my presentation on May 20, 14h40

Innovation : startup strategies
Patrick Valduriez
Inria and LIRMM, Univ. Montpellier, France

Technological innovation as driven by startups is hard to formalize (and manage) as the context may be unknown or quickly changing. To be successful, the innovation process involves not only inventions (new methods) but also context, e.g. user behavior, and timing, e.g. market readiness. In this talk, I illustrate various innovation strategies based on startup success stories, in particular LeanXcale, which delivers a new generation HTAP DBMS product. I also give hints to promote innovation within startups.

Permanent link to this article: https://team.inria.fr/zenith/2597-2/

New video with Gaëtan Heidsieck, Esther Pacitti and François Tardieu “The design of digital agriculture” May 2021.

Permanent link to this article: https://team.inria.fr/zenith/new-video-with-gaetan-heidsieck-esther-pacitti-and-francois-tardieu-the-design-of-digital-agriculture-may-2021/

Esther Pacitti at Online Seminar of Instituto de Computação – UFF on 17 March 2021.

Esther Pacitti gave an invited lecture at Seminários 2021, Instituto de Computação – UFF, Rio de Janeiro, on “Uma Perspectiva Evolutiva e Multidisciplinar do Tratamento de Dados”.

This work is in the context of the HPDaSc Inria associated team.

Watch the video on YouTube.

Permanent link to this article: https://team.inria.fr/zenith/esther-pacitti-at-online-seminar-at-instituto-de-computacao-uff-on-17-march-2021/

Patrick Valduriez at (Online) University of Paris Seminar Series on Data Analytics,  on 25 February 2021.

Patrick Valduriez at (Online) University of Paris Seminar Series on Data Analytics, in collaboration with the diNo group, on 25 February 2021, 15h.

http://helios.mi.parisdescartes.fr/~themisp/seminars/2021-02-25-Valduriez.html

Distributed Database Systems: the case for NewSQL

NewSQL [Valduriez & Jimenez-Peris 2019] is the latest technology in the big data management landscape, enjoying a fast-growing rate in the DBMS and BI markets. NewSQL combines the scalability and availability of NoSQL with the consistency and usability of SQL. By blending capabilities only available in different kinds of database systems such as fast data ingestion and SQL queries and by providing online analytics over operational data, NewSQL opens up new opportunities in many application domains where real-time decision is critical. Important use cases are eAdvertisement (such as Google Adwords), IoT, performance monitoring, proximity marketing, risk monitoring, real-time pricing, real-time fraud detection, etc. NewSQL may also simplify data management, by removing the traditional separation between NoSQL and SQL (ingest data fast, query it with SQL), as well as between operational database and data warehouse / data lake (no more ETLs!). However, a hard problem is scaling out transactions in mixed operational and analytical (HTAP) workloads over big data, possibly coming from different data stores (HDFS, SQL, NoSQL). Today, only a few NewSQL systems have solved this problem. In this talk, I introduce the solution for scalable transaction and polystore data management in LeanXcale, a recent NewSQL DBMS.

Permanent link to this article: https://team.inria.fr/zenith/patrick-valduriez-at-online-university-of-paris-seminar-series-on-data-analytics-on-25-february-2021-15h/

Prix de l’innovation Inria Académie des Sciences 2020 pour Pl@ntnet

Cette année 2020, le Prix de l’innovation Inria – Académie des sciences – Dassault Systèmes est décerné au projet interdisciplinaire Pl@ntNet (Alexis Joly et al.). Une récompense qui vient couronner dix ans de recherches au service de la biodiversité : cette plate-forme collaborative, basée sur le deep learning, est aujourd’hui utilisée par une dizaine de millions de personnes pour l’identification de plantes.

Permanent link to this article: https://team.inria.fr/zenith/prix-plantnet-2020/

Inria Brasil – the web site, 24 November 2020

The Inria Brasil web site is now open.

It reflects the collaboration between Inria and LNCC, the Brazilian National Scientific Computing Laboratory, and associated Brazilian universities  in High Performance Computing,  Artificial Intelligence, Data Science and Scientific Computing. The collaboration is headed by Frédéric Valentin (LNCC, Inria International Chair) and Patrick Valduriez.

 

 

Permanent link to this article: https://team.inria.fr/zenith/inria-brasil-the-web-site/

Patrick Valduriez at (Online) CWI lectures on Database Research, 19 Nov. 2020.

Patrick Valduriez on “Distributed Database Systems: the case for NewSQL” on 19 Nov. 2020 at (Online) CWI lectures on Database Research.

NewSQL [Valduriez & Jimenez-Peris 2019] is the latest technology in the big data management landscape, enjoying a fast-growing rate in the DBMS and BI markets. NewSQL combines the scalability and availability of NoSQL with the consistency and usability of SQL. By blending capabilities only available in different kinds of database systems such as fast data ingestion and SQL queries and by providing online analytics over operational data, NewSQL opens up new opportunities in many application domains where real-time decision is critical. Important use cases are eAdvertisement (such as Google Adwords), IoT, performance monitoring, proximity marketing, risk monitoring, real-time pricing, real-time fraud detection, etc. NewSQL may also simplify data management, by removing the traditional separation between NoSQL and SQL (ingest data fast, query it with SQL), as well as between operational database and data warehouse / data lake (no more ETLs!). However, a hard problem is scaling out transactions in mixed operational and analytical (HTAP) workloads over big data, possibly coming from different data stores (HDFS, SQL, NoSQL). Today, only a few NewSQL systems have solved this problem. In this talk, I introduce the solution for scalable transaction and polystore data management in LeanXcale, a recent NewSQL DBMS.

Permanent link to this article: https://team.inria.fr/zenith/online-cwi-lectures-on-database-research/

RISC2: New European H2020 project (2021-2022) between Europe and Latin America in HPC

The RISC2 project is a coordination network for High Performance Computing (HPC) between Europe and Latin America, funded by the European H2020 FETHPC program and the partner countries.  It is managed by Barcelona Computing Center and has eight main European HPC actors, including three Inria teams (Nachos, Seism and Zenith) and Atos Bull, and the main HPC actors from Brazil, including LNCC, Mexico, Argentina, Colombia, Uruguay, Costa Rica and Chile.

Permanent link to this article: https://team.inria.fr/zenith/risc2-new-european-h2020-project-2021-2022-between-europe-and-latin-america-in-hpc/

DEXA 2020 best paper award by Gaëtan Heidsieck, Daniel de Oliveira, Esther Pacitti, Christophe Pradal, François Tardieu, and Patrick Valduriez

Distributed Caching of Scientific Workflows in Multisite Cloud” by Gaëtan Heidsieck, Daniel de Oliveira, Esther Pacitti, Christophe Pradal, François Tardieu, and Patrick Valduriez, obtained the best paper award from the 31st International Conference on Database and Expert Systems Applications (DEXA), Springer, Sep 2020. The work has been done in collaboration with CIRAD and INRAe, in the context of the #Digitag project, and Brazil in the context of the HPDaSc Inria associated team.

 

Permanent link to this article: https://team.inria.fr/zenith/dexa-2020-best-paper-award/

SBBD 2020 tutorial by Patrick Valduriez “Principles of Distributed Database Systems: spotlight on NewSQL” 29 September 2020.

Tutorial at SBBD 2020
https://sbbd.org.br/2020/tutorial-3/
29 September 2020, 14h-16h30

Principles of Distributed Database Systems: spotlight on NewSQL
Patrick Valduriez
Inria, University of Montpellier, CNRS, LIRMM, France
LeanXcale, Spain

The first edition of the book Principles of Distributed Database Systems, co-authored with Prof. Tamer Özsu (University of Waterloo) appeared in 1991 when the technology was new and there were not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker who claimed in 1988 that in the following 10 years, centralized DBMSs would be an “antique curiosity” and most organizations would move towards distributed DBMSs. That prediction has certainly proved to be correct, and most systems in use today are either distributed or parallel.

The fourth edition of this classic textbook [Özsu & Valduriez 2020] provides major updates, in particular, new chapters on big data platforms, NoSQL, NewSQL and polystores. In this tutorial, we introduce these major updates, with a focus on NewSQL.

NewSQL is the latest technology in the big data management landscape, enjoying a fast-growing rate in the DBMS and BI markets. NewSQL combines the scalability and availability of NoSQL with the consistency and usability of SQL. By providing online analytics over operational data, NewSQL opens up new opportunities in many application domains where real-time decision is critical. Important use cases are eAdvertisement (such as Google Adwords), IoT, performance monitoring, proximity marketing, risk monitoring, real-time pricing, real-time fraud detection, etc. NewSQL may also simplify data management, by removing the traditional separation between NoSQL and SQL (ingest data fast, query it with SQL), as well as between operational database and data warehouse / data lake (no more ETLs!). However, a hard problem is scaling out transactions in mixed operational and analytical (HTAP) workloads over big data, possibly coming from different data stores (HDFS, SQL, NoSQL). Today, only a few NewSQL systems have solved this problem.

A first in-depth presentation of NewSQL was given in a tutorial at IEEE Big Data 2019 with Prof. Ricardo Jimenez-Peris (CEO and founder at LeanXcale) [Valduriez & Jimenez-Peris 2019]. In this tutorial, we provide a taxonomy of NewSQL systems based on major dimensions including targeted workloads, capabilities and implementation techniques. We illustrate with popular NewSQL systems such as Google Spanner, LeanXcale, CockroachDB, SAP HANA, MemSQL and Splice Machine. In particular, we give a spotlight on some of the more advanced systems. We also compare with major NoSQL and SQL systems, and discuss integration within big data ecosystems and corporate information systems, using polystores. Finally, we discuss the current trends and research directions.

References

[Özsu & Valduriez 2020] Tamer Özsu, Patrick Valduriez. Principles of Distributed Database Systems, 4th Edition, Springer, 2020.

[Valduriez & Jimenez-Peris 2019] Patrick Valduriez, Ricardo Jimenez-Peris. NewSQL : principles, systems and current trends. IEEE Big Data Conference, Los Angeles, December 2019.

 

Permanent link to this article: https://team.inria.fr/zenith/2381-2/