CONCEPT Cited by 1 source
Adaptive rate limiting¶
Definition¶
Adaptive rate limiting is a rate-control strategy where the limit (tokens/second, requests/second, or items/second) is periodically recomputed based on current system state — typically workload size, downstream capacity, or observed error rates — rather than being a static configuration value.
Motivation¶
Static rate limits break in two directions as systems scale: - Too low → SLA violations as population outgrows the fixed budget. - Too high → downstream overload when population shrinks or downstream degrades.
Adaptive limits track the actual need and adjust automatically.
Variants¶
- Population-proportional — rate = f(current_item_count, target_interval). Recomputed periodically. (Cloudflare Security Insights scheduler.)
- Error-driven (AIMD) — increase rate when healthy, cut on errors. (TCP congestion control, adaptive retry.)
- Latency-driven — reduce rate when p99 exceeds threshold. (Netflix concurrency limiter.)
Seen in¶
- sources/2026-06-12-cloudflare-scaling-security-insights — scan scheduler recomputes its rate every 30 minutes from account/zone counts per tier, ensuring uniform scan distribution regardless of customer growth.