PATTERN Cited by 1 source

Metastore bootstrap from object storage¶

Pattern¶

Design the metadata store so its complete state (manifest + SSTables) exists in object storage at all times. This enables two key use cases without connecting to the source cluster:

Disaster recovery (full cluster rebuild): Reconstruct the metadata store entirely from object storage, then resume operations from the last-flushed state.
Read replicas: A remote cluster bootstraps by downloading the manifest, then lazily pulls SSTables on demand to serve consumer queries.

When to use¶

The system must support recovery from complete cluster loss.
Read-only replicas in different regions should not add load to the source cluster.
The RPO guarantee (bounded by flush interval) is acceptable.

Mechanism¶

Every flush writes SSTables + manifest to object storage (the source of truth for external consumers).
DR path: New cluster downloads manifest → knows all SSTables → rebuilds internal metadata topic → determines last-written offset per partition → resumes writes.
Read-replica path: Remote cluster downloads manifest → pulls SSTables on demand → serves Kafka consumers against them.

Trade-offs¶

Pro: No cross-cluster network links needed for DR or read replicas.
Pro: Object storage provides the durability guarantee.
Con: Staleness bounded by flush interval (default 10 min for Redpanda's metastore).
Con: Bootstrap time depends on manifest/SSTable download speed.

(Source: sources/2026-06-09-redpanda-cloud-topics-the-metastore)

Seen in¶

systems/redpanda-cloud-topics-metastore — Whole Cluster Restore and read-replica topics bootstrap from object-storage metastore state