Skip to content

SYSTEM Cited by 3 sources

AWS ElastiCache

AWS ElastiCache is AWS's managed in-memory cache service, supporting the Redis and Memcached engines. Removes the operational burden of running Redis/Memcached clusters directly — patching, failover, read replicas, backups, multi-AZ — while keeping the engine semantics and client protocol the same. Typical uses: hot-path cache for application reads, fast serving tier for precomputed artifacts, rate-limit / counter storage, session storage.

Why it's mentioned here

Canva's Print Routing engine uses ElastiCache + Redis as the serving tier for per-region precomputed routing graphs:

  • 6 ms retrieval in most regions, 20 ms for the largest shards.
  • 99.999% availability (with read replicas).
  • Source of truth lives elsewhere (relational DB) — losing ElastiCache means rebuild, not data loss.

See sources/2024-12-10-canva-routing-print-orders and systems/canva-print-routing.

Seen in

  • sources/2024-12-10-canva-routing-print-orders — Canva Print Routing's graph serving tier.
  • sources/2026-04-21-figma-figcache-next-generation-data-caching-platformFigma FigCache fleet fronts a large pool of ElastiCache Redis clusters with varying isolation requirements, durability expectations, criticality characteristics, and traffic volumes. FigCache absorbs ElastiCache operational events (node failovers, cluster scaling activities, transient connectivity errors) as zero-downtime background operations — shard failovers run liberally and frequently across the entire footprint as live resiliency exercises. Canonical deployment shape: ElastiCache is the durable / ops-managed Redis; the in-house proxy tier is the unified data plane in front.
  • sources/2026-03-16-zalando-search-quality-assurance-with-ai-as-a-judgeevaluation-pipeline dedup-cache instance. Zalando's Search Quality Framework uses an ElastiCache instance scoped only to the offline LLM-as-judge evaluation tasks, caching both product-data fetches (keyed by product_id) and LLM-judge scores (keyed by (query, product) pair). Quantified cost collapse: "Instead of calling Product API (5000 × 25) times for 5000 search queries with 25 results, we only need to call it N times where N is the number of unique products in all search results." The cache is not shared with production catalog-search caches — evaluation must not pollute production hit-rate statistics or working sets. Canonical wiki instance of concepts/query-product-evaluation-cache and patterns/query-product-evaluation-cache; a contrasting shape from the Canva / Figma production- serving-tier uses.
Last updated · 542 distilled / 1,571 read