CONCEPT Cited by 1 source

Adaptive rate limiting¶

Definition¶

Adaptive rate limiting is a rate-control strategy where the limit (tokens/second, requests/second, or items/second) is periodically recomputed based on current system state — typically workload size, downstream capacity, or observed error rates — rather than being a static configuration value.

Motivation¶

Static rate limits break in two directions as systems scale: - Too low → SLA violations as population outgrows the fixed budget. - Too high → downstream overload when population shrinks or downstream degrades.

Adaptive limits track the actual need and adjust automatically.

Variants¶

Population-proportional — rate = f(current_item_count, target_interval). Recomputed periodically. (Cloudflare Security Insights scheduler.)
Error-driven (AIMD) — increase rate when healthy, cut on errors. (TCP congestion control, adaptive retry.)
Latency-driven — reduce rate when p99 exceeds threshold. (Netflix concurrency limiter.)

Seen in¶

sources/2026-06-12-cloudflare-scaling-security-insights — scan scheduler recomputes its rate every 30 minutes from account/zone counts per tier, ensuring uniform scan distribution regardless of customer growth.