ParityDb

ParityDb is an embedded persistent key-value store optimized for blockchain applications. Designed by Parity Technologies, it focuses on efficient storage and retrieval of blockchain state data, making it ideal for high-performance blockchain applications.

Design Considerations

ParityDb is specifically designed to handle the unique demands of blockchain applications:

Efficient Storage: It efficiently stores blockchain state encoded into the Patricia-Merkle trie. Keys are fixed size and uniformly distributed, and most values are small.
Read-Optimized: Prioritizes read performance over write performance to enhance blockchain transaction throughput. Writes are performed in large batches during block imports, with idle periods in between.
Reference Counting: Supports reference counting since trie nodes may be shared by multiple tries and branches.
Transactional: Supports transactions, ensuring atomicity of writes and preventing partially committed data from being queried.

Key Features

API

ParityDb provides a universal key-value storage that supports transactions. Data can be partitioned into columns, each containing entries of a single data type, such as state trie nodes, block headers, or blockchain transactions. Two types of column indexes are supported: Hash and Btree.

Transactions

The database supports multiple concurrent readers, while writes are serialized and performed in batches (transactions). Transactions are applied atomically, ensuring all data in a transaction is either written or not written at all.

No Cache

ParityDb relies on the OS page cache instead of implementing custom data caching, making the performance of a large database dependent on available system memory.

Durability

The database ensures consistency even if IO is interrupted, although the latest state before the interruption might not be restored. Committed writes not yet saved to disk by background threads are lost.

Implementation Details

Data Structure

Each column stores data in a set of 256 value tables. The first 255 tables contain entries of certain size ranges up to a 32 kB limit, while the 256th table stores entries over 32 kB, split into multiple parts. Hash columns also include a hash index file.

Metadata

A metadata file defines the database structure, including a set of columns with specific configurations.

Hash Index

The hash index is a mmap-backed dynamically sized probing hash table. Each key's 256-bit hash 'k' is used to map to a 512-byte index page. Each page contains an unordered list of 64 8-byte entries. If an insertion is attempted into a full index page, a reindex is triggered.

Value Tables

Value tables are linear arrays of fixed-size entries. Each entry can contain:

Filled Entry: Contains 240 bits of k, 15-bit data value size, compression flag, optional reference counter, and the actual value.
Tombstone Entry: Contains an index of the previous tombstone, forming a linked list of empty entries.
Multipart Entry: Similar to Filled, but holds an address of the next entry for data continuation.

Hash Index Operations

Lookup

Compute k, find the index page using the first n bits, search for a matching entry with matching key bits, and use the address to query the partial k and value from the value table.

Insertion

Triggers a reindex if an insertion is attempted into a full index page. Reindexing creates a new index table with twice the capacity and starts moving entries from the old table to the new one, checking both tables during the process.

Transaction Pipeline

On commit, all data is moved to an in-memory overlay, making it available for queries, then added to the commit queue. The commit worker processes the queue, writes modified data to a binary log file, and flushes it to disk. A finalization thread reads the log file and applies changes to the tables, clearing the overlay.

On startup, existing log files are validated and applied to restore the database state.

Similar Projects

SurrealDB

SurrealDB is a cloud-native database designed for modern applications, enabling real-time collaboration, SQL querying, and graph querying.

Starred

26.3K

Qdrant

Qdrant is a vector similarity search engine and vector database optimized for the next generation of AI applications.

Starred

18.9K

TiKV

TiKV is an open-source, distributed, and transactional key-value database, providing classical key-value and ACID-compliant transactional APIs.

Starred

14.8K

Neon

Neon is a serverless open-source alternative to AWS Aurora Postgres, separating storage and compute, and redistributing data across a cluster of nodes.

Starred

13.6K

sled

sled is an embedded database with an API similar to a threadsafe BTreeMap, offering ACID transactions, zero-copy reads, and more.

Starred

7.9K

Databend

A next-gen cloud data warehouse for high-performance, cost-effective analytics and AI-powered insights.

Starred

7.5K

RisingWave

RisingWave is a Postgres-compatible streaming database that offers an efficient approach for processing, analyzing, and managing real-time event streaming data.

Starred

6.6K

toyDB

toyDB is a distributed SQL database written in Rust, designed as a learning project with features including Raft-based consensus, ACID transactions, and pluggable storage engines.

Starred

Materialize

Materialize is a cloud-native data warehouse purpose-built for operational workloads. It uses SQL to build real-time automation, interactive data products, and reduce the cost of data freshness.

Starred

5.7K

Noria

Noria is a new streaming data-flow system designed to act as a fast storage backend for read-heavy web applications.

Starred

ParadeDB

ParadeDB is an Elasticsearch alternative built on Postgres, offering real-time search and analytics.

Starred

4.9K

GreptimeDB

GreptimeDB is a cloud-scale, fast, and efficient time-series database designed for scalability, efficiency, and analytical capabilities. It offers a robust alternative to InfluxDB and long-term storage for Prometheus.

Starred

LanceDB

LanceDB is an open-source database for vector search with persistent storage, supporting multimodal data such as text, images, and videos. It offers production-scale vector search, filtering, and management of embeddings.

Starred

3.7K

CozoDB

A general-purpose, transactional, relational database using Datalog for queries, embeddable and performant with a focus on graph data and algorithms.

Starred

3.3K

Skytable

Skytable is a modern NoSQL database focusing on performance, flexibility, and scalability, featuring its own query language, BlueQL.

Starred

2.3K

SQLSync

SQLSync is a collaborative offline-first wrapper around SQLite designed to synchronize web application state between users, devices, and the edge.

Starred

2.2K

IndraDB

IndraDB is a graph database written in Rust, designed for high performance, safety, and simplicity of implementation. It supports directed and typed graphs with JSON-based properties and offers cross-language support via gRPC.

Starred

2.1K

USearch

A compact and high-performance similarity search engine for vectors and texts.

Starred

rsedis

rsedis is a re-implementation of Redis in Rust, aimed at learning Rust and offering a multi-threaded, cross-platform alternative to Redis.

Starred

1.8K

PumpkinDB

PumpkinDB is an immutable ordered key-value database engine featuring ACID transactions, persistent storage, and an embedded programming language.

Starred

1.4K

AtomicServer

Create, share, fetch and model Atomic Data with AtomicServer, a lightweight yet powerful CMS and Graph Database.

Starred

877

FnckSQL

FnckSQL is a lightweight, LSM KV-based SQL DBMS implemented by individual developers as a learning project. It supports various SQL features and provides a unique take on database management.

Starred

517

DarkBird

A document-oriented, in-memory database optimized for fast real-time data searches, supporting features like persistence, concurrency, full-text search, and vector storage.

Starred

417

Garage

Garage is an S3-compatible distributed object storage service designed for self-hosting at a small-to-medium scale. It is lightweight, easy to operate, and highly resilient to machine failures.

Starred

407

Lucid KV

Lucid KV is a high performance, secure, and distributed key-value store with an HTTP API, built with Rust. It offers features such as persistence, encryption, compression, and replication.

Starred

371

Native DB

Native DB is a high performance, distributed, and embedded key-value store for multi-platform apps, supporting Rust types and providing real-time subscriptions with a simple API.

Starred

361

terminusdb-store

TerminusDB-store is a tokio-enabled data store for triple data, optimized for efficient storage and retrieval of subject-predicate-object triples.

Starred

361

DB3 Network

DB3 Network is a Lightweight, Permanent JSON document database for Web3, designed to store and retrieve data for decentralized applications built on blockchain technology.

Starred

348

WooriDB

A general-purpose experimental time series database with schemaless, key-value storage.

Starred

131

libmdbx-rs

Rust bindings for libmdbx, providing a high-performance embedded database.

Starred

Qrlew

Qrlew is a Rust library developed by Sarus Technologies, providing various functionalities and utilities for Rust applications.

Starred

ParityDb

Design Considerations

Key Features

API

Transactions

No Cache

Durability

Implementation Details

Data Structure

Metadata

Hash Index

Value Tables

Hash Index Operations

Lookup

Insertion

Transaction Pipeline

Similar Projects

SurrealDB

Qdrant

TiKV

Neon

sled

Databend

RisingWave

toyDB

Materialize

Noria

ParadeDB

GreptimeDB

LanceDB

CozoDB

Skytable

SQLSync

IndraDB

USearch

rsedis

PumpkinDB

AtomicServer

FnckSQL

DarkBird

Garage

Lucid KV

Native DB

terminusdb-store

DB3 Network

WooriDB

libmdbx-rs

Qrlew

Rustfinity.com

Links

Socials

Legal