Research

Our approach is to capitalize on the principles of distributed and parallel data management. In particular, we exploit: high-level languages as the basis for data independence and automatic optimization; data semantics to improve information retrieval and automate data integration; declarative languages (algebra, calculus) to manipulate data and workflows; and highly distributed and parallel environments such as P2P, cluster and cloud. To reflect our approach, we organize our research program in four complementary themes:

  1. Data integration, including data capture and cleaning;
  2. Query processing, including indexing and privacy;
  3. Scientific workflows, in particular, in grid and cloud;
  4.  Data analytics, including data mining and statistics;
  5. Machine learning for high-dimensional data processing and search.

Key-words

  • Data science, big data, scientific data
  • Cluster, cloud, peer to peer
  • Distributed and parallel data management, data integration,data privacy,  data analytics, machine learning, data search, content-based image retrieval

Activity reports

Permanent link to this article: https://team.inria.fr/zenith/research/