PATTERN Cited by 2 sources
Workload-segregated clusters¶
Definition¶
Workload-segregated clusters is the operational pattern of running dedicated clusters per workload shape instead of a single "one-size-fits-all" cluster handling every workload type. Each cluster's configuration is tuned to the concurrency, query-complexity, and resource profile of its workload; a query gateway (or equivalent routing layer) in front of the fleet matches each incoming query to the cluster whose shape fits it best.
The canonical three shapes (Trino / Presto world)¶
From sources/2026-03-24-expedia-operating-trino-at-scale-with-trino-gateway — Expedia names this as "a common pattern for organizations using Trino at scale":
Adhoc clusters¶
- Mixed-shape workloads; medium concurrency.
- Varied query complexity, simple → moderate.
- Used for exploratory analysis and development.
- "Versatile, supporting mixed workloads that range from simple to moderately complex queries. They provide a balanced environment for exploratory data analysis and development tasks."
ETL clusters¶
- High-volume, highly complex queries with low concurrency.
- Fine-tuned for heavy data processing tasks (data integration, transformation, cleansing, enrichment).
- Optimised for few very large queries, not many small ones.
- "The primary goal is to prepare optimized datasets for downstream consumption."
BI clusters¶
- Low-complexity queries with high concurrency.
- May be configured to refresh pre-aggregated data.
- Support BI tools like Tableau and Looker where many users hit dashboards simultaneously.
- "High concurrency ensures that multiple users can access dashboards and reports simultaneously without performance issues."
A fourth shape sometimes carved out:
Metadata clusters¶
- Single-node, extremely lightweight.
- Serves metadata queries (
select version(),show catalogs) that BI tools run as health checks. - Segregating these off main clusters protects dashboard extract flows from failures cascading out of big-query contention.
Why segregate instead of over-provision one big cluster¶
A single cluster hosting all three shapes has to be sized for the intersection of their requirements:
- ETL's memory ceiling per query (huge).
- BI's concurrency ceiling (huge).
- Adhoc's unpredictability (wide).
The cluster either gets over-provisioned for all three (expensive and still not-quite-right for any) or gets tuned to one shape and suffers interference from the others (tail latency, cascading failures, queue-behind effects).
Key interference cases named in the post:
"Queries against massive tables can cause significant delays for smaller queries on shared clusters. By routing large table queries to specialized clusters, organizations can ensure that smaller queries execute without being queued behind resource-intensive operations."
"Metadata queries such as
select version()orshow catalogsare frequently run by BI tools to check cluster health. Failures in these queries can lead to subsequent extract failures. By implementing routing rules that direct metadata queries to a lightweight, single-node Trino cluster, organizations can reduce extract failure rates and improve dashboard load times."
Segregation + routing is the cheaper, more reliable, more predictable alternative. Each cluster is tuned narrowly; tail behavior of one workload cannot spill into another.
Required complement: a routing layer¶
Workload-segregated clusters are user-visible without a routing layer — each user has to know which cluster to connect to. This does not scale; the pattern always pairs with a query gateway or equivalent routing layer that hides the segregation from users. The gateway exposes one URL; routing rules pick a cluster per query.
Related patterns outside the SQL-query-engine world¶
- Tenant isolation — segregation by tenant identity instead of workload shape; often overlaps in multi-tenant analytics platforms.
- Kafka broker tier segregation — separate broker groups per topic class (high-throughput ingest vs. low-throughput control-plane).
- Dedicated GPU pools — training pools vs. inference pools, often with workload-aware routing on top.
Seen in¶
-
— applies the pattern at the OLTP/OLAP workload-archetype axis rather than at the within-analytics-engine axis (Expedia's Trino shapes) or the metrics-storage axis (Airbnb). Savannah Longoria (PlanetScale, 2023-12-01) canonicalises the workload-segregation principle as PlanetScale's architectural recommendation when HTAP is off the table: "If you have a complex application with distinct transactional and analytical workloads that can be separated, then it may be more appropriate to use separate databases for each workload. This approach allows each database to be optimized for its specific workload and can provide better performance and scalability." The fundamental mechanism is physical resource isolation — don't share CPU / memory / I/O bandwidth across workload classes. Operationally this becomes a PlanetScale OLTP cluster paired with a specialised OLAP warehouse (Snowflake / BigQuery / Redshift), connected by ETL engines like Airbyte, Fivetran, or Stitch — the CDC driver ecosystem is the routing-layer complement at the database-workload tier.
-
sources/2026-03-24-expedia-operating-trino-at-scale-with-trino-gateway — Expedia's Adhoc / ETL / BI segregation fronted by Trino Gateway.
- sources/2026-04-21-airbnb-building-a-fault-tolerant-metrics-storage-system — Airbnb applies the pattern to a metrics-storage fleet rather than a query-engine fleet: dedicated clusters for specialized workloads (compute-infra, mesh-infra) separated from general application clusters, with a Promxy-based federation proxy presenting the unified view. The segregation is driven by blast-radius concerns plus workload-shape differences (infra-metric cardinality profiles vs. application metrics). Sequenced rollout is layered on top — see patterns/progressive-cluster-rollout.
Related¶
- patterns/query-gateway — the required routing complement.
- concepts/workload-aware-routing — the routing discipline.
- patterns/routing-rules-as-config — how the routing layer is configured.
- concepts/tenant-isolation — adjacent segregation axis.
- concepts/active-multi-cluster-blast-radius — the reliability- complement axis that usually goes alongside workload segregation.
- concepts/shuffle-sharding — the sub-cluster tenant-isolation complement.
- patterns/progressive-cluster-rollout — deploy-sequencing complement for multi-cluster fleets.
- systems/trino — canonical query engine for this pattern.
- systems/airbnb-metrics-storage — applied to a metrics-storage fleet rather than a query engine.