SYSTEM Cited by 1 source
Netflix Data Bridge¶
Stub — minimal viable page¶
Netflix Data Bridge is Netflix's internal data-movement substrate, mentioned in the 2026-06-03 dynamic-partition-splitting post (Source: sources/2026-06-03-netflix-dynamically-splitting-wide-partitions-in-cassandra-for-time-series-workloads) as the platform that runs offline Spark verification jobs to confirm that dynamically-split partitions are an exact match to the original. The TimeSeries team uses Data Bridge as the second layer of correctness validation alongside online pre/post-checksum gates (concepts/checksum-validated-data-migration).
This page is a stub — Netflix has a deep blog post on Data Bridge (Data Bridge: How Netflix Simplifies Data Movement, linked from the 2026-06-03 post but not summarised in this ingest) which a future ingest can deepen.
Role in the wiki so far¶
- Offline correctness substrate for dynamic partition splitting — runs Spark jobs that verify split data against original, beyond online pre/post-checksum gates.
- Implied: a general data-movement substrate at Netflix for ETL, replication, multi-store synchronisation, and offline verification.
Seen in¶
- sources/2026-06-03-netflix-dynamically-splitting-wide-partitions-in-cassandra-for-time-series-workloads — Named as the substrate for defence-in-depth offline verification of dynamic-partition-splits. "Using our existing Data Bridge pipelines to verify splits offline … Spark job to ensure that the split data is an exact match to the original data."
Related¶
- systems/netflix-timeseries-abstraction — the consumer of Data Bridge for split verification.
- systems/apache-cassandra — the source/target store for verified data.
- systems/apache-spark — the verification engine running on Data Bridge.
- concepts/checksum-validated-data-migration — the online primary check that Data Bridge complements.
- patterns/dynamic-partition-split-async-pipeline — the pipeline Data Bridge gives defence-in-depth to.