PATTERN Cited by 1 source
Middle-tier storage media¶
Context¶
Storage architectures at hyperscale settle into media tiers — HDD below, flash above. Over time, each tier's economics shift: HDD areal-density climbs while its BW/TB falls (head seeks are flat while capacity grows); flash gets denser + cheaper but remains expensive relative to HDD.
Workloads that previously fit HDD's BW/TB band can stop fitting when HDD density crosses a threshold. They become stranded on HDD — technically on the drive, but suffering IOPS-per-byte starvation. Promoting them to the flash tier costs more than the workload justifies. Overprovisioning HDDs (buying more drives for IOPS) defeats the capacity/$ rationale of HDD.
A gap opens between the top of the HDD tier and the bottom of the flash tier.
The pattern¶
Insert a new media tier whose cost/performance/endurance profile sits between the two incumbents. Drive-level density lower than top-tier flash but substantially higher than HDD; BW/TB higher than HDD but lower than top-tier flash; cost/byte between the two.
Discipline:
- Identify the stranded-workload band. What BW/TB range is underserved? Meta 2025: ~10-20 MB/s/TB, the band where 16-20 TB HDDs were adequate and where large-batch-IO workloads currently overpay for TLC.
- Name the media that fills it. Often a denser-bit-cell variant (QLC here) that density-scaling has made economical.
- Validate the workload-endurance match. The new media's endurance floor must be met by the target workload's write profile with headroom. Meta explicitly matches QLC to read-BW-intensive + low-write workloads.
- Compose form factor + software stack. New media often exposes new asymmetries (R/W), new constraints (package count), new interfaces (userspace FTL). Each needs its own design.
- Co-design with a vendor partner willing to move at your pace. Meta + Pure Storage (DFM) is the canonical 2025 instance; Meta + NAND vendors for the standard-NVMe path.
- Accept hybrid-cost honesty up front. The new tier is usually not yet cost-competitive with the lower tier; early deployments are justified by power efficiency, density, and the cost of not solving the stranded-workload problem — not by total cost per byte.
Canonical instance: Meta QLC 2025¶
Meta's 2025-03-04 post introduces QLC flash between HDD and TLC flash:
| Tier | BW/TB | Role |
|---|---|---|
| HDD | ~5-10 MB/s/TB (falling) | Cold bulk |
| QLC | 10-20 MB/s/TB | Batch IO / read-BW-intensive (new) |
| TLC | 50+ MB/s/TB | Mixed / write-heavy |
Meta's density target: 6× the densest TLC-based server today, with individual QLC drives scaling to 512 TB (U.2-15mm standard) or 600 TB (Pure Storage DFM).
When to apply¶
- Two-tier structure under stress: bottom tier's BW/TB has fallen below workload requirements; top tier is overkill + overpaid.
- New media available that covers the gap at a materially different cost / power / density profile.
- Target workloads exist whose shape (here: read-BW-intensive + low-write) matches the new media's strengths.
- Willingness to invest in the stack transformations — form factor, software, operations, migration.
When NOT to apply¶
- Workload volume too small for a new tier's amortised operational cost to pay off.
- New media's differentiation is marginal (e.g., second-source of existing tier, not a new band).
- Workload shape doesn't match the new media's endurance/asymmetry profile — forcing mixed workloads onto QLC would burn endurance and force the rate controller into degenerate regimes.
Trade-offs¶
- Operational complexity grows — three tiers means three form factors, three software paths, three hardware-vendor relationships.
- Migration takes years at hyperscale. You plan for multi-year phased adoption.
- Cost-per-byte parity not required at launch — power savings, density, and avoided-stranding can justify the deployment even at a cost premium. Meta is explicit.
Adjacent patterns¶
- patterns/tiered-storage-to-object-store — tier the location of data (local vs object store). Orthogonal axis.
- patterns/co-design-with-ocp-partners — how you build out the new tier's partner roster.
Seen in¶
- sources/2025-03-04-meta-a-case-for-qlc-ssds-in-the-data-center — canonical Meta instance of inserting a QLC tier between HDD and TLC.
Related¶
- concepts/bandwidth-per-terabyte / concepts/storage-media-tiering — the framing vocabulary.
- concepts/write-endurance-nand / concepts/hard-drive-physics — the properties that define where tier boundaries fall.
- systems/qlc-flash / systems/tlc-flash — the canonical tier members.
- companies/meta.