Skip to content

CONCEPT Cited by 1 source

Adaptive upload parallelism

Definition

Adaptive upload parallelism is the property of a write pipeline that automatically adjusts the number of concurrent upload streams based on current throughput, without operator tuning. The system converges on fewer streams (larger batches, fewer PUTs, lower cost) under light load and splits into more streams (smaller batches, higher throughput, acceptable latency) under heavy load.

This is distinct from static parallelism settings (fixed concurrency per broker) or manually configured thresholds. The adaptation happens continuously and bidirectionally (Source: sources/2026-06-18-redpanda-adaptive-write-request-scheduling).

Why it matters

Object-storage-backed streaming systems face a three-way tension: - Cost: Each S3 PUT costs money regardless of payload size. Fewer PUTs = lower cost. - Latency: Waiting to fill large batches adds delay. More streams = more chances to hit size threshold before time threshold. - Throughput ceiling: A single upload stream caps at one in-flight PUT; under heavy load this becomes the bottleneck.

A static setting optimizes for one operating point; adaptive parallelism optimizes across the full range.

Seen in

Last updated · 542 distilled / 1,571 read