SYSTEM Cited by 3 sources
Delta Lake¶
Delta Lake is an open-source concepts/open-table-format built over systems/apache-parquet on object storage. It is one of the three canonical OTFs alongside systems/apache-iceberg and Apache Hudi, and the table format native to Databricks' Data Intelligence Platform.
Minimum viable framing for this wiki: it plays the same architectural role as Iceberg — ACID transactions, schema evolution, time-travel, snapshot-based metadata over immutable columnar files. See concepts/open-table-format for the shared shape and the gap the format class fills above concepts/immutable-object-storage.
Features cited in ingested sources¶
- Deep Clone. Incrementally materialises a snapshot of a Delta table (typically backed by another Delta table, or a Delta-Sharing share) as a new, physically separate Delta table in the clone's object store. Subsequent deep-clone runs transfer only the delta since the previous clone. This is the replication primitive Mercedes-Benz's cross-cloud Sync Jobs use — it's the thing that makes patterns/cross-cloud-replica-cache economically viable at 60 TB.
VACUUM. Cleans up files referenced by old snapshots once retention policy has passed. Mercedes-Benz leans onVACUUMon the replicated Delta tables to enforce GDPR right-to-be-forgotten: deletions on the source propagate through the next Deep Clone sync; old files are then vacuumed out of ADLS on the recipient side.
(Source: sources/2026-04-20-databricks-mercedes-benz-cross-cloud-data-mesh)
Seen in¶
- sources/2026-04-20-databricks-mercedes-benz-cross-cloud-data-mesh
— Delta format for the local replica tier; Deep Clone as the
incremental-sync primitive;
VACUUMas the GDPR-compliance hook on the replica side. - sources/2024-07-29-aws-amazons-exabyte-scale-migration-from-apache-spark-to-ray-on-ec2 — named as one of the three open table formats Amazon BDT's Ray compactor (systems/deltacat) is being extended to support, alongside systems/apache-iceberg and systems/apache-hudi. The BDT in-house copy-on-write compactor predates these OTFs and gave its design back to DeltaCAT as the Flash Compactor.
- sources/2026-04-17-databricks-governing-coding-agent-sprawl-with-unity-ai-gateway — telemetry-destination face: Unity AI Gateway auto-lands coding-agent OpenTelemetry metrics + traces into Unity-Catalog-managed Delta tables, making AI-tool telemetry a first-class Lakehouse dataset joinable with HR / PR-velocity / capacity-planning data. See patterns/telemetry-to-lakehouse.
Related¶
- systems/apache-iceberg — peer open table format; Mercedes-Benz stored source data in Iceberg on AWS Glue, then federated+shared it as Delta via UC + Delta Sharing.
- systems/delta-sharing — wire-level exchange protocol for Delta tables across metastores / clouds / partners.
- systems/unity-catalog — governance / catalog plane that owns the Delta (and federated Iceberg) tables end-to-end.
- concepts/open-table-format — the format class.