BlobSeer is a large-scale distributed storage service that addresses advanced data management requirements resulting from ever-increasing data sizes. It is centered around the idea of leveraging versioning for concurrent manipulation of binary large objects in order to efficiently exploit data-level parallelism and sustain a high throughput despite massively parallel data access.

Features include:

  • support to store large binary large objects (BLOBs) that reach the order of TB
  • fine grain access (e.g., in the order of MB)
  • versioning: each write generates a new fully independent snapshot of the blob; all past snapshots are accessible
  • data and metadata decentralization
  • high throughput under heavy access concurrency in any combination: read/read, read/write, write/write.
BlobSeer website