Skip to content

SYSTEM Cited by 5 sources

Presto

Definition

Presto is the original distributed SQL query engine from Facebook (open-sourced 2013). After the 2020 governance schism it split into two forks — PrestoDB (under the Linux Foundation / Presto Foundation) and PrestoSQL — with the latter renamed Trino in December 2020.

Role in this wiki

Presto appears primarily as the historical predecessor of Trino. Named explicitly in sources/2026-03-24-expedia-operating-trino-at-scale-with-trino-gateway: "Trino — a fork of PrestoSQL — is a powerful tool in modern data analytics."

Its other relevance here is as the ancestor of the gateway pattern: Lyft built Presto Gateway as a proxy and load balancer for PrestoDB, and that gateway was later forked and integrated into the Trino ecosystem as Trino Gateway. The cluster-segregation + workload-aware-routing architecture Expedia describes behind Trino Gateway therefore predates the Trino rename — it is a Presto-era pattern that the Trino ecosystem inherited.

Seen in

Known large-scale deployments

  • Meta (internal)"tens of thousands of machines", every query routed through an internal Gateway; canary + shadow cluster deployment pipeline; automated cluster standup/decommission wired into the data-warehouse hardware pipeline. See sources/2023-07-16-highscalability-lessons-learned-running-presto-at-meta-scale.
  • Lyft (historical) — built the original open-source Presto Gateway that was later renamed Trino Gateway.
  • Pinterest (internal) — named as a query substrate for Piqama's auto-rightsizing service: "a separate auto-rightsizing service to continuously consume historical data from various sources, including Presto, Iceberg, and user-defined data sources." Canonical wiki instance of Presto used as the analytics front-end for platform-telemetry feedback loops (historical- usage auto-rightsizing) over Iceberg on S3. Also the execution engine underneath the Analytics Agent — the agent's four-layer architecture pushes LLM-generated SQL through Presto with EXPLAIN- before-EXECUTE validation + bounded retry + default LIMIT 100.

Seen in (additional)

Last updated · 542 distilled / 1,571 read