CONCEPT Cited by 2 sources

Feature freshness¶

Definition¶

Feature freshness is the service-level property describing how recently a feature value in an online store reflects the underlying signal that produced it. If a user opens a document at T, a feature derived from "documents the user recently interacted with" should include that document as of some bounded interval T+Δ. The Δ is the freshness.

Stated in the Dropbox Dash feature-store post:

"Stale features can lower ranking quality and hurt user experience, so our feature store had to reflect new signals as soon as possible, often within minutes of user actions."

"Relevance also depends on speed and capturing user intent in real-time. If a user opens a document or joins a Slack channel, that signal should show up in their next search—within a few seconds." (Source: sources/2025-12-18-dropbox-feature-store-powering-real-time-ai-dash)

Why it's a first-class concern¶

Freshness is co-equal with latency in ranking-quality-driven systems:

Latency without freshness = fast answers based on stale signals → wrong ranking order.
Freshness without latency = correct ranking order, but retrieved too slowly for the user.
Both are required; the architecture has to hit both.

Unlike raw database replication lag, freshness isn't just "when did the last write land" — it's the end-to-end time from the event at the source system to the derived feature value being served, including:

Event emission (a doc opens, a user joins a channel).
Capture at source (CDC, event bus, log).
Ingestion into the feature pipeline.
Feature transformation.
Write to the online store.
Availability to the next read.

Each step has its own cost/latency trade-off — covered by the three lanes of patterns/hybrid-batch-streaming-ingestion.

Three ingestion lanes map to three freshness tiers¶

Batch — minutes-to-hours, amenable to change detection; good for heavy joins/aggregations over historical windows.
Streaming — seconds-to-minutes, near-real-time; good for collaboration/interaction signals that must surface "in their next search."
Direct writes — seconds, for precomputed features produced by an adjacent pipeline (e.g. LLM evaluation scores) that skips batch entirely.

Not every feature needs the same freshness. "User opened doc X 2 seconds ago" is a streaming feature; "document embedding" can tolerate hours of staleness. A feature-store design that forces every feature onto the same ingestion path either over-invests in freshness (cost) or under-delivers on relevance.

Measurement¶

The Dropbox post doesn't quantify freshness numerically — it names "within minutes" as the bar for batch features and "within a few seconds" as the bar for real-time signals, and leaves the impact on ranking quality ("lower ranking quality") unquantified.

A rigorous freshness SLO tracks p50 / p95 / p99 of end-to-end delay for each feature, binned by source signal, and ties it to an offline-measured delta in ranking quality (NDCG or similar) as a function of staleness. This post doesn't disclose those numbers but is directionally consistent with the discipline.

concepts/feature-store — feature freshness is one of the dominant design axes.
patterns/hybrid-batch-streaming-ingestion — the architectural pattern that lets different features hit different freshness tiers.
patterns/change-detection-ingestion — the optimization that makes the batch lane's freshness bar achievable at cost.
systems/dash-feature-store — the canonical Dropbox instance.

Seen in¶

sources/2025-12-18-dropbox-feature-store-powering-real-time-ai-dash — canonical wiki introduction; "within minutes" / "within a few seconds" freshness bars for Dash ranking.
sources/2026-01-06-lyft-feature-store-architecture-optimization-and-evolution — Lyft's Feature Store frames freshness in qualitative terms ("ultra-low-latency" cache, "near-real time" streaming, "generous TTL" on the ValKey write-through layer) rather than named numbers, but the three-lane architecture exists specifically to support per-feature freshness tiering: batch (daily) for heavy history, streaming (seconds) via Flink for interaction signals, on-demand SDK CRUD (seconds) for adjacent-producer features.