CONCEPT Cited by 3 sources
Data Plane Atomicity¶
Definition¶
Data Plane Atomicity is Redpanda's design tenet for BYOC cloud deployments: no deployment can bring down any other deployment, including a control-plane outage, and the data path at every customer runs with zero runtime dependency on any externalised service (no external consensus, secret managers, databases, offset-recording services, or metadata lookups).
Canonical statement on the wiki¶
The founder-voice canonical statement (Source: Gallego 2025-04-03):
"Six years ago, we spent a year writing the first version of our cloud, then threw it away and wrote it again, which led to the birth of what we call BYOC. At the time, this decision was controversial because we were not charging incremental dollars to resell Amazon infrastructure, simply the value of the software we delivered."
"The goal was to have a single deployment model that we could build on top for years to come that would be compatible with the strictest security and compliance requirements. The main concept we came up with was Data Plane Atomicity. The premise was that no deployment should be able to bring down any other deployment, including a control plane outage."
"All failures on all data planes are independent for the data path while still providing a fully managed service. No externalized consensus algorithm, secret managers, no external databases, no external offset recording service, or metadata look up as you are trying to write your data durably to disk. At worst, you wouldn't be able to click through an upgrade process, but your data path would always be up — hooray! (And without getting paged at 3 AM.)"
Load-bearing tenets¶
The definition enumerates five specific "no runtime dependencies":
- No externalized consensus algorithm. The data-plane cluster runs its own per-partition Raft groups (see systems/redpanda); no outside consensus service in the write path.
- No external secret managers. Secrets needed for the data path are locally materialised.
- No external databases. No managed-DB lookup as a prerequisite to durable write.
- No external offset-recording service. Offsets live in the cluster itself; there is no metadata service whose outage would freeze the producer.
- No external metadata lookup as you are trying to write. The data path is self-describing.
Failure mode framing is sharp: on a control-plane outage you lose the ability to click through an upgrade process, but everything else — produce, consume, replicate, elect a new leader, serve metadata to clients — continues uninterrupted on the customer's own infrastructure.
Why atomicity matters more in BYOC than in SaaS¶
In a classic multi-tenant SaaS, a control-plane outage can be rate-limited in blast radius by sharding; some customers degrade, others don't. In BYOC every customer already owns the data plane (deployed in their own VPC) — so the control-plane boundary is sharper: the customer expects the control plane to be a convenience layer, not a dependency in their hot path. "Our insight was that since the dataplane is already deployed within the customer's network, it is used to connect the internal applications already with Redpanda Connect's portfolio."
This is why Data Plane Atomicity is load-bearing: it's the design contract that lets Redpanda market BYOC as "a fully managed service" while the customer's data never leaves their VPC and the data path never blocks on Redpanda-owned infrastructure.
Relationship to existing wiki concepts¶
- concepts/control-plane-data-plane-separation — Data Plane Atomicity is an extreme instantiation of CP/DP separation. The standard pattern says control plane outages should not stop the data plane; Data Plane Atomicity goes further: no runtime dependency on the control plane at all.
- concepts/static-stability — the AWS-canon name for the same shape: the data plane works correctly when its dependencies are at their last-known-good state. Data Plane Atomicity is a stricter condition than static stability: it demands the data path make no runtime calls to dependencies at all, rather than just tolerating dependency staleness.
- concepts/isolation-as-fault-tolerance-principle — the PlanetScale-canon framing: small isolated units fail independently. Data Plane Atomicity applies this at the tenant / deployment cardinality.
- concepts/control-plane-impact-without-data-plane-impact — the necessary consequence: a bad control-plane push affects manageability but not availability.
- concepts/managed-data-plane — the BYOC-model superset; atomicity is the correctness-invariant a managed-data-plane design must satisfy.
- concepts/cell-based-architecture — the deployment shape that operationalises the Data Plane Atomicity invariant. Where Data Plane Atomicity says "no runtime dependencies", cell-based architecture says "deploy as independent cells so the no-dependencies property holds per-cell and no cross-cell dependencies can form."
Contrast with the "rent-markup" cloud model¶
Gallego frames the decision as controversial at the time: "we were not charging incremental dollars to resell Amazon infrastructure, simply the value of the software we delivered." The implicit contrast is with Confluent Cloud / managed-Kafka-SaaS products, which run the data plane in the vendor's account and charge infrastructure markup. BYOC + Data Plane Atomicity sidesteps both the commercial model (no AWS-rent markup) and the architectural coupling (no cross-tenant blast radius).
Caveats¶
- Not a benchmark — the invariant is asserted by the vendor; no external third-party verification is cited in the source post.
- Upgrade UX degrades under control-plane outage — the essay is explicit: you "wouldn't be able to click through an upgrade process"; the promise is availability, not full manageability.
- BYOC deployment model specifically — the atomicity claim is made for BYOC; Redpanda's Serverless / Dedicated SaaS variants are not necessarily atomic in the same sense.
- Mechanism detail is deferred. The source post points at a 2021 BYOC Data Plane Atomicity post (not yet in the wiki corpus) for the implementation-level disclosure; the 2025-04-03 post is the founder's retrospective framing, not the mechanism-depth disclosure.
Seen in¶
-
sources/2025-06-20-redpanda-behind-the-scenes-redpanda-clouds-response-to-the-gcp-outage — 2025-06-20 retrospective on the 2025-06-12 GCP outage. Canonicalises cell-based architecture as the deployment shape that operationalises Data Plane Atomicity — the post is explicit: "We have taken this pattern further and made it a product principle" (linking to the 2021 BYOC Data Plane Atomicity post). Data Plane Atomicity is the invariant; cell-based architecture is the deployment shape that makes the invariant achievable. The GCP outage was the real-world validation: hundreds of Redpanda Cloud customer clusters were structurally isolated from each other and from any shared Redpanda-operated services, so the upstream cloud outage had no blast-radius path into the customer fleet.
-
sources/2025-04-03-redpanda-autonomy-is-the-future-of-infrastructure — Gallego's founder-voice canonical statement of the tenet as BYOC's central design invariant, framed as the prerequisite for Redpanda as agent-era infrastructure (data must be durably writable on the customer's own infrastructure with no vendor runtime dependency).
-
sources/2025-10-28-redpanda-governed-autonomy-the-path-to-enterprise-agentic-ai — 2025-10-28 ADP governance-framing post carries the tenet forward to the agent-infrastructure altitude. The ADP VPC/BYOC deployment axis inherits Data Plane Atomicity — agents can operate on private data inside the customer VPC without vendor control-plane runtime dependencies. Substrate for digital sovereignty at agent altitude.
Related¶
- systems/redpanda-byoc
- systems/redpanda
- concepts/control-plane-data-plane-separation
- concepts/static-stability
- concepts/isolation-as-fault-tolerance-principle
- concepts/control-plane-impact-without-data-plane-impact
- concepts/managed-data-plane
- concepts/digital-sovereignty
- concepts/model-to-data-vs-data-to-model