PATTERN Cited by 1 source

Hot cluster for iterative ML¶

Description¶

Keep compute clusters warm and provisioned between ML workflow runs rather than cold-starting a new cluster for each execution. This eliminates the provisioning delay (often 30 seconds to several minutes) that interrupts rapid iterative experimentation.

The pattern is most valuable when: - Teams are iterating rapidly (multiple runs per hour) - Cluster provisioning time is a significant fraction of total run time - The cost of idle resources is justified by developer productivity gains

Trade-offs¶

Pro	Con
Eliminates provisioning latency	Idle clusters consume resources
Enables rapid iteration cycles	Requires idle-timeout / cost-management policy
Reduces context-switching for developers	May create multi-tenant contention if shared

Seen In¶

sources/2026-06-10-atlassian-architecting-scalable-ml-platforms — ML Studio hot clusters remain active between runs, eliminating wait time for cluster provisioning

(Source: sources/2026-06-10-atlassian-architecting-scalable-ml-platforms)