PATTERN Cited by 1 source
ReadPlan metadata cache¶
Pattern¶
Separate metadata resolution (path → storage address mapping) from data fetch; cache the resolved ReadPlan in a distributed memory store for sub-2 ms access on repeated reads.
Problem¶
Even with a unified metadata schema achieving O(1) lookups, hot BLOBs accessed concurrently by hundreds of GPUs create metadata hot-shard problems. Every concurrent reader hitting the same metadata path creates load concentration.
Solution¶
Cache the ReadPlan — the resolved mapping from object path to (blockId, offset, size) tuples — in a distributed-memory store (similar to memcache). Subsequent reads of the same BLOB skip the metadata lookup entirely and go straight to data fetch.
Result at Meta¶
- 1–2 ms metadata access for cached ReadPlans
- Solves the metadata hot-shard problem
- Improves p50 and p99 latencies by serving from memory
- Combined with the distributed data cache, absorbs traffic spikes from GPU restarts and checkpoint loading
(Source: sources/2026-07-01-meta-ai-storage-blueprint-at-scale)
Seen in¶
- sources/2026-07-01-meta-ai-storage-blueprint-at-scale — Meta AI storage spikes and hot spots