Valda Seminar: Pierre Senellart

7 February 2025, 11:00-12:00.

ENS, room S16/FS101 (Aile Rataud, level -1).

ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data

We present the data model, design choices, and performances of ProvSQL, a general and easy-to-deploy provenance tracking and probabilistic database system implemented as a PostgreSQL extension. ProvSQL’s data and query models closely reflect that of a large core of SQL, including multiset semantics, the full relational algebra, and terminal aggregation. A key part of its implementation relies on generic provenance circuits stored in memory-mapped files. We propose benchmarks to measure the overhead of provenance and probabilistic evaluation and demonstrate its scalability and competitiveness with respect to other state-of-the-art systems.

Comments are closed.