PhD defense of Federico Ulliana

14.00, room 455, PCRI

Title: “Types for Detecting XML Query-Update Independence”

Abstract:

In the last decade XML became one of the main standards for data storage and exchange on the Web. Detecting XML query-update independence is crucial to efficiently perform data management tasks, like those concerning view-maintenance, concurrency control, and security. This thesis presents a novel static analysis technique to
detect XML query-update independence, in the presence of a schema. Rather than types, the presented system infers chains of types. Each chain represents a path that can be traversed on a valid document during query/update evaluation. The resulting independence analysis is precise, although it raises a challenging issue: recursive schemas may lead to infer infinitely many chains. This thesis presents a sound and complete approximation technique ensuring a finite analysis in any case, together with an efficient implementation performing the chain-based analysis in polynomial space and time.

Permanent link to this article: https://team.inria.fr/oak/2012/12/12/phd-defense-of-federico-ulliana/

Amélie Gheerbrant: Pattern-based XML queries

14.00, room 445, PCRI

Abstract
We present results of our study of pattern-based XML query languages. A variety of analogs of conjunctive queries over XML documents have been studied in the literature. We focus on languages based on tree patterns, which follow the structure of tree documents. They arise naturally in many problems related to integrating and exchanging data and to handling incomplete information. In this talk, we concentrate on the problems of finding certain answers to queries, and on static analysis, in particular, query containment.

The problem of computing certain answers arises when one queries incompletely specified databases, i.e., databases with missing information. As often happens, the complexity of the problem jumps when one moves from relations to XML. Nonetheless, we identified large relevant classes of queries for which efficient algorithms can be developed. Curiously, for some of these classes, no analogous results existed in the relational world. In fact, we uncovered a well-behaved class of relational queries which had been overlooked so far.

Testing for query containment is a fondamental task in query optimization. In the relational case, classical homomorphism based tools lead to reasonable complexity algorithms. In XML such techniques can be applied only for very simple queries. Beyond thoses classes of queries, they can either be adapted by using more sophisticated data structures, or – provably – they cannot be adapted at all. In addition, we look at the containment problem in the presence of extra features (such as schemas for documents).

The talk will be based on papers from ICDT’12 and ICDT’13.

Short bio
Amélie Gheerbrant did a PhD about fixed-point logics on trees in Amsterdam with Johan van Benthem and Balder ten Cate. She is currently doing a postdoc in Edinburgh with Leonid Libkin. Her main research interests are in the areas of data management and applications of logic in computer science.

Slides
Amelie Gheerbrant – Pattern-based XML queries

Permanent link to this article: https://team.inria.fr/oak/2012/12/07/amelie-gheerbrant-pattern-based-xml-queries/

ISI: Des triplets sur des arbres.

Des triplets sur des arbres. Un modèle hybride XML-RDF pour documents annotés
by François Goasdoué, Konstantinos Karanasos, Yannis Katsis, Julien Leblay, Ioana Manolescu, Stamatis Zampetakis
In Ingénierie des Systèmes d’Information – Vol 17/5 – 2012 – pp.87-111

Permanent link to this article: https://team.inria.fr/oak/2012/12/05/isi-des-triplets-sur-des-arbres/

ICDE 2013: Triples in the clouds

Triples in the clouds
by Zoi Kaoudi and Ioana Manolescu
Tutorial in IEEE ICDE 2013

Permanent link to this article: https://team.inria.fr/oak/2012/11/06/icde-2013-triples-in-the-clouds/

Oak team at CIKM 2012

The conference took place in Maui, Hawaii, USA from October 29 to 2nd November. The team has presented two demo papers and a workshop paper.

  • “AMADA: Web Data Repositories in the Amazon Cloud” by Andrés Aranda-Andújar, Francesca Bugiotti, Jesús Camacho-Rodríguez, Dario Colazzo, Francois Goasdoué, Zoi Kaoudi and Ioana Manolescu (demo paper).
  • “The Nautilus Analyzer: Understanding and Debugging Data Transformations” by Melanie Herschel and Hanno Eichelberger (demo paper).
  • “Data Bridges: Data Integration for Digital Cities” by Melanie Herschel and Ioana Manolescu, in the City Data Management Workshop (CDMW).

Permanent link to this article: https://team.inria.fr/oak/2012/11/06/oak-team-at-cikm-2012/

Evaggelia Pitoura: Diversifying query results and answering historical queries on large graphs

14.00, room 445, PCRI

Abstract
In this talk, I shall briefly present our recent research on two topics: (a) supporting diversity in query results and (b) managing historical queries on graphs. Diversity has recently attracted a lot of attention as a means of increasing user satisfaction in recommendation systems, information retrieval and database queries. Diversification takes many forms such as increasing novelty and covering different query aspects and user information needs. In our recent research, we have proposed efficient indexing techniques for supporting continuous diversification as well as a new definition of diversity based on graph theory that allows adapting the degree of diversification. In the second part of the talk, I shall touch upon our new work on supporting historical queries on large graphs (e.g., on social networks).

Short bio
Evaggelia Pitoura is an Associate Professor and Vice Chair of the Department of Computer Science, University of Ioannina, Greece where she also leads the Distributed Management of Data Lab. She holds a PhD and an MSc degree in Computer Science from Purdue University. Her research interests are in distributed data management, with a recent focus on social networks and ranking by using preferences and diversity. Her publications include more than 150 articles in international journals and conferences and a highly-cited book on mobile computing. She is the recipient of a Best Paper Award (ICDE 1999), a Marie Currie Fellowship (2009) and two ACM Recognition of Service Awards. She regularly serves on the PC of many highly ranked database conferences and currently as demo PC co-chair for VLDB 2012 and PC co-chair for ICDE 2012. Her research has been supported by national and European funding agencies.

Permanent link to this article: https://team.inria.fr/oak/2012/11/02/evaggelia-pitoura-diversifying-query-results-and-answering-historical-queries-on-large-graphs/

ACM TODS: Optimizing XML Querying using Type-based Document Projection

Optimizing XML Querying using Type-based Document Projection
by Veronique Benzaken, Giuseppe Castagna, Dario Colazzo and Kim Nguyen
in ACM Transactions on Database Systems (TODS)

Permanent link to this article: https://team.inria.fr/oak/2012/10/15/acm-tods-optimizing-xml-querying-using-type-based-document-projection/

The LRI’s Nao robot

The LRI has acquired a NAO robot from Aldebaran.

Here is a demo from today:

Jean-Christophe Souplet, who is the Nao handler for now, welcomes ideas of using it next to demos that are inspired from what the teams are doing. The Nao can be programmed with relatively low difficulty, using a range of pre-defined gestures; it has a voice synthesizer etc. Of course not everything makes sense, but it may be good to know that we can use it to “enhance demo user experience” – or for something more serious, also, of course.

Permanent link to this article: https://team.inria.fr/oak/2012/10/12/the-lris-nao-robot/

Ekaterina Tzompanaki and Stamatis Zampetakis join the team

Ekaterina and Stamatis have just arrived for their PhDs. Welcome!

Permanent link to this article: https://team.inria.fr/oak/2012/10/01/ekaterina-tzompanaki-and-stamatis-zampetakis-join-the-team/

PhD defense of Mohamed-Amine Baazizi

10.30, room 435, PCRI

Title: “Analyse statique pour l’optimisation des mises à jour de documents XML temporels”

Abstract:

The last decade has witnessed a rapid expansion of XML as a format for representing and exchanging data through the web.  In order to follow this evolution, many languages have been proposed to query, update or transform XML documents. At the same time, a range set of systems allowing to store and process XML documents have been developed. Among these systems, main-memory engines are lightweight systems that are the favoured choice for applications that do not require complex functionalities of traditional DBMS such as secondary storage indexes or transaction management. These engines require loading the documents to be processed entirely into main-memory. Consequently, they suffer from space limitations and are not able to process quite large documents.

In this thesis, we investigate issues related to the evolution of XML documents and to the management of the temporal dimension for XML. This thesis consists of two parts sharing the common goal of developing efficient techniques for processing large XML documents using main-memory engines. The first part investigates the optimization of update for static XML documents. We have developed a technique based on XML projection, a method that has been proposed to overcome the limitations of main-memory engines in the case of querying. We have devised for a new scenario for projection allowing the propagation of the updates effects.

The second part of the thesis investigates the issue of building and maintaining time-stamped XML documents under space limitations. Our contribution consists in two methods. The first method can be applied in the general case where no restriction is made on the evolution of the XML documents. This method is designed to be performed in streaming and allows thus processing large time-stamped  documents. The second method deals with the case where the changes are specified by updates. This method is based on the projection paradigm which allows it for processing large time-stamped  documents and for generating time-stamped documents which are satisfactory from the point of view of storage.

Permanent link to this article: https://team.inria.fr/oak/2012/09/07/phd-defense-of-mohamed-amine-baazizi/