Skip to content

CONCEPT Cited by 3 sources

Scale to Zero

Scale-to-zero is a service-design property in which an application consumes no capacity and accrues no charge when it has no traffic, yet can serve the next request without customer-initiated capacity actions.

Design commitment

Lambda elevated scale-to-zero from an implementation detail to a stated tenet in its 2014 PR/FAQ: "Our service will scale customer applications without changes to their code or configuration. We will architect such that one application invocation per month and 1,000 per second are both well supported."

Operationally: "Lambda imposes no warm-up or cool-down periods or charges… Infrequent or periodic jobs are cost effective, sharing capacity with other users and only charging for actual execution time."

(Source: sources/2024-11-15-allthingsdistributed-aws-lambda-prfaq-after-10-years)

What this requires the provider to own

  • Multi-tenant placement. Idle customers can't keep hosts warm; the provider has to pack active work onto shared capacity. See systems/firecracker for Lambda's isolation primitive.
  • Fast cold start. The wait on the first request is the price of zero idle cost. See concepts/cold-start and SnapStart for how Lambda has attacked this over the years.
  • Fine-grained billing granularity. Without per-ms billing, a "scales to zero" service would still charge customers for partial execution quanta. See concepts/fine-grained-billing.
  • Placement engine that compacts per-account workloads. From the PR/FAQ: "Each new request is placed with respect to minimizing the number of instances dedicated to that account… Spiky workloads, heterogeneous workloads, and short-lived jobs such as cron or batch applications all use capacity efficiently."

Scale-to-zero is the direct counterpart to provisioned / always-warm compute. The two are often offered together (Lambda Provisioned Concurrency, container always-on, etc.) so users can choose per-workload: zero cold starts vs. zero idle cost.

Seen in

Last updated · 200 distilled / 1,178 read