CONCEPT

DDSketch error-bounded percentile¶

A DDSketch (arXiv:1908.10693) is a streaming quantile sketch that provides relative-error guaranteed percentiles over a stream of numeric values without retaining the raw values. Reported p50 / p95 / p99 are within a bounded relative error of the true quantile, for a memory footprint logarithmic in the value range.

(Source: .)

Why sketches for percentiles¶

Observability systems need per-pattern / per-endpoint / per- tenant percentiles, potentially across millions of series. Two naive approaches fail at scale:

Retain every value and sort at query time — cost is O(N) memory per series; impossible at N = billions.
Sample (keep 1 in K) — biases tail quantiles exactly where precision matters most (concepts/tail-latency-at-scale).

Quantile sketches give a third option: bounded memory, bounded relative error, mergeable across shards / workers / time buckets.

DDSketch mechanism (sketch)¶

DDSketch buckets values on a logarithmic scale — bucket i covers [γ^i, γ^(i+1)) where γ = (1 + α)/(1 − α) and α is the relative-error bound. Inserting a value increments the bucket count. Quantile queries scan buckets in order to find the one containing the target rank; returned value is within α relative error of the true quantile by construction.

Properties:

Mergeable: bucket-counts sum, so sketches from different shards / intervals can be combined without loss of error bounds.
Compact: bucket count grows logarithmically in the value range.
Rank-preserving for tails: relative-error bound means p99 / p99.9 are reported with the same precision as p50, which is the core property histograms fail to deliver.

Use in PlanetScale Insights¶

Rafer Hazen, 2023-04-20: "We also send along a sketch of query execution times that allows us to show error-bounded percentiles (e.g., median and p95)." The sketch is emitted per query-pattern fingerprint per interval, merged in the aggregation pipeline (Kafka → ClickHouse per the post), and queried at UI time.

This is the structural answer to "how do you give every query pattern a p95 without storing every execution?" on a platform that handles "thousands or even millions of queries per second".

Seen in¶

— Canonical wiki α = 0.01 datum. Rafer Hazen, 2023-08-10: "We're using ⍺=0.01 which is sufficiently accurate (estimates can be off by at most 1%) and yields suitably small sketches." First wiki citation of the concrete PlanetScale-production relative-error bound — the 2023-04-20 sister post cited only the arXiv paper number. Additional canonical disclosures on this post: sketches persist as MySQL BLOB columns (concepts/sketch-as-mysql-binary-column); percentile calculation is done in SQL via custom loadable C++ MySQL functions that know how to read and merge the binary format; sketches emitted per fingerprint per 15-second interval by VTGate, merged in-memory per Kafka consumer batch (concepts/in-memory-coalescing-by-kafka-key) before write, then merged across time buckets at read time. "Performing these functions in MySQL allows us to calculate percentiles without needing to pull the underlying sketches into our application. It also lets us use the full expressive power of SQL to get the data we need." Canonical wiki dataset that fills in the production mechanism beneath the 2023-04-20 post's arXiv-only citation.
— canonical wiki disclosure that PlanetScale Insights uses a streaming quantile sketch (arXiv 1908.10693) to emit error-bounded per-pattern percentiles. Post references the paper number without naming DDSketch in the prose; the cited algorithm is the Datadog DDSketch family.

DDSketch error-bounded percentile¶

Why sketches for percentiles¶

DDSketch mechanism (sketch)¶

Use in PlanetScale Insights¶

Seen in¶

Related¶