7 releases

new 0.3.0 Nov 17, 2024
0.2.0 Sep 15, 2024
0.1.4 Aug 30, 2024

#18 in Database implementations

Download history 314/week @ 2024-08-10 161/week @ 2024-08-17 35/week @ 2024-08-24 174/week @ 2024-08-31 2/week @ 2024-09-07 219/week @ 2024-09-14 6/week @ 2024-09-21 100/week @ 2024-09-28 34/week @ 2024-10-05 14/week @ 2024-10-12 17/week @ 2024-10-19 241/week @ 2024-10-26 154/week @ 2024-11-02 397/week @ 2024-11-09 109/week @ 2024-11-16

901 downloads per month

Apache-2.0

700KB
16K SLoC

SlateDB

Crates.io Version GitHub License slatedb.io Discord Docs

Introduction

SlateDB is an embedded storage engine built as a log-structured merge-tree. Unlike traditional LSM-tree storage engines, SlateDB writes data to object storage (S3, GCS, ABS, MinIO, Tigris, and so on). Leveraging object storage allows SlateDB to provide bottomless storage capacity, high durability, and easy replication. The trade-off is that object storage has a higher latency and higher API cost than local disk.

To mitigate high write API costs (PUTs), SlateDB batches writes. Rather than writing every put() call to object storage, MemTables are flushed periodically to object storage as a string-sorted table (SST). The flush interval is configurable.

To mitigate write latency, SlateDB provides an async put method. Clients that prefer strong durability can await on put until the MemTable is flushed to object storage (trading latency for durability). Clients that prefer lower latency can simply ignore the future returned by put.

To mitigate read latency and read API costs (GETs), SlateDB will use standard LSM-tree caching techniques: in-memory block caches, compression, bloom filters, and local SST disk caches.

Checkout slatedb.io to learn more.

Get Started

Add the following to your Cargo.toml:

[dependencies]
slatedb = "*"
bytes = "*"
object_store = "*"
tokio = "*"

Then you can use SlateDB in your Rust code:

use bytes::Bytes;
use slatedb::db::Db;
use slatedb::config::DbOptions;
use slatedb::object_store::{ObjectStore, memory::InMemory};
use std::sync::Arc;

#[tokio::main]
async fn main() {
    // Setup
    let object_store: Arc<dyn ObjectStore> = Arc::new(InMemory::new());
    let options = DbOptions::default();
    let kv_store = Db::open_with_opts(
        "/tmp/test_kv_store",
        options,
        object_store,
    )
    .await
    .unwrap();

    // Put
    let key = b"test_key";
    let value = b"test_value";
    kv_store.put(key, value).await;

    // Get
    assert_eq!(
        kv_store.get(key).await.unwrap(),
        Some(Bytes::from_static(value))
    );

    // Delete
    kv_store.delete(key).await;
    assert!(kv_store.get(key).await.unwrap().is_none());

    // Close
    kv_store.close().await.unwrap();
}

SlateDB uses the object_store crate to interact with object storage, and therefore supports any object storage that implements the ObjectStore trait. You can use the crate in your project to interact with any object storage that implements the ObjectStore trait. SlateDB also re-exports the object_store crate for your convenience.

Documentation

Visit slatedb.io to learn more.

Features

SlateDB is currently in the early stages of development. It is not yet ready for production use.

  • Basic API (get, put, delete)
  • SSTs on object storage
  • Range queries (#8)
  • Block cache (#15)
  • Disk cache (#9)
  • Compression (#10)
  • Bloom filters (#11)
  • Manifest persistence (#14)
  • Compaction (#7)
  • Transactions

License

SlateDB is licensed under the Apache License, Version 2.0.

Dependencies

~18–33MB
~508K SLoC