CONCEPT Cited by 1 source

LLM provider commitment lock-in¶

Definition¶

LLM provider commitment lock-in is the model-upgrade friction introduced by multi-month dedicated-capacity contracts on managed LLM serving substrates (Amazon Bedrock Provisioned Throughput is the canonical example). Customers who reserve capacity for a model version become economically committed to that version through the contract term; superior models released during the term sit unused until the existing contract expires.

The wiki canonical framing comes from Slack's 2026-05-28 multi-cloud retrospective (Source: sources/2026-05-28-slack-slack-ai-the-path-to-multi-cloud):

"The Commitment Lock-in: Provisioned Throughput often required commitments of one to six months. In the fast-moving world of LLMs, where a state-of-the-art model can be superseded in weeks, these commitments effectively slowed down our ability to upgrade. Even when a superior model was released, we often chose to wait for our existing commitments to expire before migrating."

Why it matters¶

Three load-bearing properties combine into the lock-in:

Model upgrade pace exceeds commitment term. Slack's verbatim "a state-of-the-art model can be superseded in weeks" means the optimal model at the start of a 6-month contract is rarely still the optimal model at the end.
Sunk-cost economic bias. Even when a clearly-better model exists, the contractually-paid-for capacity makes migration look expensive — "we often chose to wait for our existing commitments to expire before migrating."
Capacity is per-model, not pooled. PT contracts on Bedrock are bound to a specific model SKU. Switching models doesn't re-use the existing PT capacity automatically.

Concept	Relationship
concepts/llm-model-feature-lag	Feature lag delays availability on the secondary substrate. Commitment lock-in delays upgrades on the primary substrate. Different time-axes.
concepts/llm-over-provisioning-cycle	Sibling PT failure mode: over-provisioning is the off-peak cost waste; commitment lock-in is the upgrade-cadence cost.
concepts/vendor-lock-in (general)	LLM-serving-specialised case of the broader vendor-lock-in concept; specifically the sub-pattern where contract terms slow internal-product evolution.
concepts/provisioned-throughput-vs-on-demand-llm	The PT/OD axis where commitment lock-in is the PT-side cost.
concepts/multi-cloud-llm-serving	The cross-cloud architectural resolution that breaks the lock-in path.

Why On-Demand resolves it¶

OD pricing is per-token / per-request with no commitment. Migration to a superior model is a routing-layer change — zero contract reconciliation, zero stranded capacity, zero sunk-cost bias. Slack's Phase 3 verbatim:

"more importantly, it removed the technical bottleneck we faced in Phase 2: because we were no longer locked into multi-month commitments, we regained the freedom to migrate features to different models. This meant that as soon as a more performant model dropped and passed our internal quality and metrics bars, we could pivot our infrastructure to support it within a day, rather than waiting months for a contract to expire."

The disclosed migration-cadence improvement: months → 1 day for a model swap when no PT commitment is in the way.

Trade-offs¶

OD breaks the lock-in but introduces its own failure modes (see concepts/provisioned-throughput-vs-on-demand-llm):

Service-level variability (shared-resource model).
Regional capacity orchestration dependency on the provider.
Concentration risk — heavy OD usage on one provider creates a different reliability problem.

The hybrid resolution (patterns/provisioned-throughput-with-on-demand-spillover) balances commitment-lock-in mitigation (OD for bursty workloads that change models often) against the latency consistency needs (PT for stable workloads that don't change models often).

Quantitative signature (disclosed)¶

Property	Value
PT contract term (Bedrock)	1–6 months
Model state-of-the-art turnover	"weeks"
Migration-cadence with PT	months (wait for contract expiry)
Migration-cadence without PT	1 day (per Slack Phase 3)

Seen in¶

sources/2026-05-28-slack-slack-ai-the-path-to-multi-cloud — canonical wiki disclosure of LLM provider commitment lock-in as the structural Phase 2 → Phase 3 driver complementing the over-provisioning cycle. Verbatim "weeks" state-of-the-art turnover figure
1-day post-OD migration cadence.