PATTERN Cited by 1 source

Downsample / recode for long-term archive¶

Intent¶

Before moving media into long-term retention (cold storage, Glacier, tape), re-encode or downsample it to a lower- fidelity format that preserves just-enough quality for compliance / occasional-lookup access — shedding 5–10x on storage cost without losing the regulatory-required signal.

Motivating example: Amazon Connect call recordings¶

From the 2022-07-11 roundup (Source: sources/2022-07-11-highscalability-stuff-the-internet-says-on-scalability-for-july-11th-2022), via an AWS architecture-blog post titled Serverless architecture for optimizing Amazon Connect call-recording archival costs:

"Long term retention requirements cost a lot in storage. But you don't probably have to keep data stored in its original high fidelity encoding. Recode and downsample it to save big bucks."

Classic case: a 7-year call-recording retention requirement for financial / regulatory compliance. Calls are recorded at high-fidelity WAV; 99.9% of them will never be accessed again. Downsample to 8-kHz narrowband MP3 before the 30-day-old cold-storage transition: intelligible for compliance pulls, 1/10 the storage footprint.

Generalized pattern¶

Stage	Fidelity	Cost tier
Live / hot	Original-quality WAV / FLAC / raw video	S3 Standard
Recent-review window (~30 days)	Original	S3 Standard / IA
Archive (~30 days – years)	Downsampled + re-encoded	S3 Glacier / Deep Archive

The re-encode step is cheap compared to the retention cost; the breakeven is typically a few days of retained storage.

When not to apply¶

Forensic / legal-hold data: some regulations require preservation of the original bitstream.
Training data for ML that may benefit from high-fidelity features (speech recognition, video understanding) — keep a high-fidelity path for model-retraining use cases and a downsampled path for compliance.

systems/aws-s3 — the canonical tiered object-store.
sources/2022-07-11-highscalability-stuff-the-internet-says-on-scalability-for-july-11th-2022.
companies/highscalability.

Downsample / recode for long-term archive¶

Intent¶

Motivating example: Amazon Connect call recordings¶

Generalized pattern¶

When not to apply¶

Related¶