Skip to content

CONCEPT Cited by 1 source

Metaflow extension mechanism

The Metaflow extension mechanism is the published (but not-yet- stable) API that lets organisations plug company-specific integrations into Metaflow — new compute backends, dependency managers, orchestrators, deployment targets — without forking the framework. It is the seam along which Metaflow's foundational platform posture holds together.

What it enables

At Netflix, the extension mechanism is where the following integrations live (Source: sources/2024-07-22-netflix-supporting-diverse-ml-systems-at-netflix):

  • @titus decorator — dispatches steps to Titus instead of AWS Batch / Kubernetes.
  • Fast Data — Netflix's Iceberg + Arrow + high-throughput S3 library, exposed as metaflow.Table + metaflow.MetaflowDataFrame.
  • Maestro integration — dispatches workflows to Maestro instead of AWS Step Functions / Argo / Airflow.
  • metaflow.Cache — precomputed-predictions KV interface.
  • metaflow.Hosting — decorator-driven REST serving.
  • Portable execution environmentsmetaflow environment CLI + execution-time env fetch (see concepts/portable-execution-environment).

Public template

The open-source template is at github.com/Netflix/metaflow-extensions-template, and Netflix's own extensions package lives at github.com/Netflix/metaflow-nflx-extensions.

Stability caveat

Netflix's post is explicit:

"These integrations are implemented through Metaflow's extension mechanism which is publicly available but subject to change, and hence not a part of Metaflow's stable API yet. If you are curious about implementing your own extensions, get in touch with us on the Metaflow community Slack."

Organisations building on the extension mechanism take on migration risk when the API stabilises; the trade-off is access to capabilities — portable envs, custom compute / orchestrator backends — that aren't yet available through the stable API.

Why it matters

Without this seam, every Netflix-specific integration would require either (a) a Netflix fork of Metaflow, or (b) ad-hoc shims around Metaflow in each team's flow — recreating the "unsustainable operational overhead" that Metaflow's foundational-platform posture exists to avoid.

Seen in

Last updated · 550 distilled / 1,221 read