Yannis Papakonstantinou will give a talk this Monday at 11 am, Turing Building, Flowers room.
The SQL++ Query Language: Support for native JSON, while backwards-compatible with SQL
Yannis Papakonstantinou, Prof. Computer Science and Engineering, UCSD
SQL-on-Hadoop, NewSQL and NoSQL databases provide semi-structured data models (typically JSON-based). They now drive towards declarative, SQL-alike query languages. However, their idiomatic, non-SQL language constructs, the many variations and the lack of formal syntax and semantics pose problems. Notably, database vendors end up with unclear semantics and complicated implementations, as they add one feature at-a-time.
The presented SQL++ semi-structured data model bridges JSON and the SQL data model. The SQL++ query language is backwards compatible with SQL, while supporting native JSON. We show that a relatively small set of SQL restriction removals and feature additions is enough to provide a SQL-compatible extension to semistructured data. SQL++ is currently being adopted by the industry.
The extension to Configurable SQL++ includes configuration options that describe different options of language semantics and formally capture the variations of existing database languages. Configurable SQL++ is unifying: By appropriate choices of configuration options, the Configurable SQL++ semantics can morph into the semantics of any of eleven popular semistructured databases, which we surveyed, as the experimental validation shows. In this way, Configurable SQL++ allows a formal characterization of the capabilities of the emerging query languages.
Yannis Papakonstantinou is a Professor of Computer Science and Engineering at the University of California, San Diego. His research is in the intersection of data management technologies and the web, where he has published over ninety five research articles and received over 13,000 citations. He has given multiple tutorials and invited talks, has served on journal editorial boards and has chaired and participated in program committees for many international conferences and workshops. He also teaches for UCSD’s Master of Advanced Studies in Data Science.