Skip to content

PATTERN Cited by 1 source

ReadPlan metadata cache

Pattern

Separate metadata resolution (path → storage address mapping) from data fetch; cache the resolved ReadPlan in a distributed memory store for sub-2 ms access on repeated reads.

Problem

Even with a unified metadata schema achieving O(1) lookups, hot BLOBs accessed concurrently by hundreds of GPUs create metadata hot-shard problems. Every concurrent reader hitting the same metadata path creates load concentration.

Solution

Cache the ReadPlan — the resolved mapping from object path to (blockId, offset, size) tuples — in a distributed-memory store (similar to memcache). Subsequent reads of the same BLOB skip the metadata lookup entirely and go straight to data fetch.

Result at Meta

  • 1–2 ms metadata access for cached ReadPlans
  • Solves the metadata hot-shard problem
  • Improves p50 and p99 latencies by serving from memory
  • Combined with the distributed data cache, absorbs traffic spikes from GPU restarts and checkpoint loading

(Source: sources/2026-07-01-meta-ai-storage-blueprint-at-scale)

Seen in

Last updated · 567 distilled / 1,685 read