Melanie Herschel: The Nautilus Analyzer – Understanding and Debugging Data Transformations

14.00, room 445, PCRI

When developing data transformations – a task omnipresent in applications like data integration, data migration, data cleaning, or scientific data processing – developers quickly face the need to verify the semantic correctness of the transformation.
Declarative specifications of data transformations, e.g., SQL or ETL~tools, increase developer productivity but usually provide limited or no means for inspection or debugging. In this situation, developers today have no choice but to manually analyze the transformation and, in case of an error, to (repeatedly) fix and test the transformation.

The goal of the Nautilus project is to semi-automatically support this analysis-fix-test cycle. This talk and demonstration focus on one main component of Nautilus, namely the Nautilus Analyzer that helps developers in understanding and debugging their data transformations. After a brief introduction to different algorithms implemented within Nautilus, the demonstration will show the capabilities of this component for data transformations specified in SQL on scenarios from different domains that are based on real-world data.

Permanent link to this article: