CONCEPT Cited by 1 source
Tiered cache as planetary memory¶
Definition¶
Tiered cache as planetary memory is the architectural analogy that models a globally distributed storage system as a CPU memory hierarchy at planet scale — with local memory/flash as L1/L2, regional storage as L3, and global storage as "disk" — with transparent on-demand hydration between layers, just as an OS kernel hydrates data through page cache and CPU caches.
The Analogy (Meta's Framing)¶
What if we think of storage as a disk in a planet-scale computer and borrow ideas from the operating-system world?
- L1 cache: GPU host memory (distributed peer cache via Owl)
- L2 cache: GPU host flash (local SSD)
- L3 cache: Regional BLOB-storage fabric (flash-backed)
- Disk / Source of truth: Global BLOB-storage fabric (HDD-backed)
When a dataloader reads, data is hydrated on demand through these layers transparently — just as a Linux process reading from disk triggers transparent hydration through page cache → L2 → L1 CPU caches.
Enabling Mechanisms¶
- Dataloader prefetch: next batch prefetched into L1 during current batch processing
- Deep prefetch API: explicit
prefetch()triggers hydration from global → L3 + metadata cache warmup - Automatic data lifecycle: TTL and LRU eviction policies, capacity/quota-aware
Impact¶
Eliminated hours of explicit data ingestion time. Researchers ingest data once to global storage and access it from any region without thinking about regional boundaries.
(Source: sources/2026-07-01-meta-ai-storage-blueprint-at-scale, "Maximizing Research Velocity" section)
Seen in¶
- sources/2026-07-01-meta-ai-storage-blueprint-at-scale — canonical instance at Meta