2021 M2 Internship: Study and performance benchmarking of Collaborative Edge systems

2021 — M2 Internship: Study and performance benchmarking of Collaborative Edge systems

Background

Large-scale web applications today are built on top of high-performance
Cloud servers hosting a distributed database. Geo-replication in Cloud
datacenters is used to avoid network latency and provide fast response
time. Nevertheless, the closest DC is often still too far away for an
optimal user experience. To remain available at all times, client-side
applications need to cache data at client machines, caching data at
client machines can improve availability and latency for many
applications, and also allow for temporary disconnection. This approach
is used in many recent cloud services, like Google Drive RT API or
Facebook News Feed, where developers implement caching and buffering at
application level, but it doesn’t ensure system-wide consistency
guarantees.

Many prior work efforts have studied data management in settings where
clients are intermittently connected to servers or to peers. Bayou [6]
pushed data replicas to the edge in the context of mobile environments,
then Cimbiosys [7] extended the decentralized synchronization model to
Internet Services, in addition to Rover and Coda [8], those systems
supports disconnected operations but rely on a weak consistency model.

Recently, Edge systems focuses around the direct interactions among web
application users. Parse [9], YJS and Cloud Types [10] are programming
models for shared cloud data, they allow local data copies to be stored
on the edge client and later be synced with the cloud, but provides only
an eventual consistency model. To guarantee that all replicas converge
to the same state despite concurrent updates SwiftCloud [11], Collab,
Legion [5] and Automerge [12] relies on Conflict-free Replicated Data
Types (CRDTs) [3].

To address those issues, our team developed Antidote and
EdgeAnt. Antidote is a data store that provides an adequate consistency
semantics with optimal performance by minimizing the need for
synchronization between storage replicas. It offers a causally
consistent transactional API and a toolkit of convergent data types that
accommodates the typical needs of distributed applications. EdgeAnt
extends Antidote with a consistent, mutable cache on the Edge device,
with the same API and consistency guarantees. With support for client
migration, P2P group communication and load placement at the edge or at
the datacenter.

Research objectives and methods

The aim of this project is to do a state of the art study of a five selected Edge-collaborative systems, in terms of features, implementation and standard performance benchmarking. We break down the project into the following steps:

  1. Brief research and implementation study of the state of the art
    selected systems.
  2. Adaptating an existing real-world collaborative benchmark application to the API of
    each system.
  3. Benchmark the five systems on a geo-replicated infrastructure (G5000
    or AWS).
  4. Interesting results can lead to a research publication.

How to apply

The intern must:

  • Be enrolled in Computer Science / Informatics or a related field.
  • Have an excellent academic record.
  • Be strongly interested in, and have good knowledge of, distributed systems and
    cloud platforms.
  • Be motivated by experimental research.

The internship is funded, and will take place in the Delys group, at Laboratoire d’Informatique de Paris-6 (LIP6), in Paris. It will be advised by Ilyas Toumlilt and supervised by Dr. Marc Shapiro. A successful intern will be invited to apply for a PhD.

To apply, contact Ilyas Toumlilt ilyas.toumlilt@lip6.fr with the following information:

  • A resume or Curriculum Vitæ.
  • A list of courses and grades of the last two years of study (an informal transcript is
    OK).
  • Names and contact details of two references (people who can recommend you),
    whom we will contact directly.

Bibliography

[1] AntidoteDB. http://syncfree.github.io/antidote/.

[2] D. Terry. “Replicated Data Consistency Explained Through Baseball” Communications of the ACM Vol. 56 N. 12, 2013. https://cacm.acm.org/magazines/2013/12/169945-replicated-data-consistency-explained-through-baseball/

[3] M. Shapiro, N. Preguiça, C. Baquero, and M. Zawirski. Conflict-free replicated data types. In Int. Symp. on Stabilization, Safety, and Security of Distributed Systems (SSS),
2011. http://www.springerlink.com/content/3rg39l2287330370/.

[4] D. D. Akkoorath, A. Z. Tomsic, M. Bravo, Z. Li, T. Crain, A. Bieniusa, N. Preguiça, and M. Shapiro.
“Cure: Strong semantics meets high availability and low latency.” In Int. Conf. on Distributed Comp. Sys. (ICDCS),
2016. http://doi.ieeecomputersociety.org/10.1109/ICDCS.2016.98.

[5] A. van der Linde, P. Fouto, J. Leitão, N. Preguiça, S. Castiñeira and A. Bieniusa. Legion: Enriching Internet Services with Peer-to-Peer Interactions, Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, April 3-7, 2017. http://dx.doi.org/10.1145/3038912.3052673

[6] Terry, Douglas B., et al. “Managing update conflicts in Bayou, a weakly connected replicated storage system.” ACM SIGOPS Operating Systems Review 29.5 (1995): 172-182.

[7] Ramasubramanian, Venugopalan, et al. “Cimbiosys: A platform for content-based partial replication.” Proceedings of the 6th USENIX symposium on Networked systems design and implementation. 2009.

[8] Satyanarayanan, Mahadev, et al. “Coda: A highly available file system for a distributed workstation environment.” IEEE Transactions on computers 39.4 (1990): 447-459.

[9] https://parseplatform.org/

[10] https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/final-with-color.pdf

[11]
M. Zawirski, N. Preguiça, S. Duarte, A. Bieniusa, V. Balegas, M. Shapiro.
“Write Fast, Read in the Past: Causal Consistency for Client-side Applications.”
Annual
ACM/IFIP/USENIX Middleware conference
, Dec. 2015.
Vancouver, BC, Canada.
http://dx.doi.org/10.1145/2814576.2814733.

[12] Kleppmann, Martin, and Alastair R. Beresford. “Automerge: Realtime data sync between edge devices.” 1st UK Mobile, Wearable and Ubiquitous Systems Research Symposium (MobiUK 2018). https://mobiuk.org/abstract/S4-P5-Kleppmann-Automerge. pdf. 2018.

Comments are closed.