SYSTEM Cited by 2 sources
Amazon Redshift Spectrum¶
Amazon Redshift Spectrum is the query-against-S3 extension of Amazon Redshift: run SQL directly over data in S3 without loading it into the cluster, via external tables in the Glue Data Catalog. Similar substrate + capability to Amazon Athena (which is fully serverless); Redshift Spectrum integrates with an existing Redshift cluster for joins against warehouse-resident data.
Stub — expand as dedicated sources arrive.
Seen in¶
- sources/2026-04-01-aws-automate-safety-monitoring-with-computer-vision-and-generative-ai — the BI team queries risk-event data sitting in S3 via Redshift Spectrum "without the hassle of moving or loading it elsewhere, making historical analysis fast and straightforward." Powers QuickSight dashboards for facility violation hotspots, shift-to-shift trends, and cross-facility performance comparison.
- sources/2025-05-27-yelp-revenue-automation-series-testing-an-integration-with-third-party-system — canonical verification-latency bypass role. Yelp's Staging Pipeline writes Revenue Data Pipeline output to AWS Glue tables on S3, queryable immediately via Redshift Spectrum. Daily integrity-check SQL runs against staging output via Spectrum; monthly checks run against Redshift-resident billing-system data. Spectrum as the "querying substrate when the Redshift Connector's ~10-hour latency would block same-day verification" — a use-case distinct from BI-resident dashboarding.