PATTERN Cited by 1 source
Slow-fast hierarchical policy¶
Pattern¶
A slow-fast hierarchical policy structures a decision system into two asynchronous layers:
- Slow Policy (Planner): Runs at a low cadence (e.g., weekly). Makes strategic decisions about constraints and targets using long-horizon signals. Writes decisions to a shared store.
- Fast Policy (Executor): Runs on every action opportunity (real-time). Makes tactical decisions (e.g., which item to select) within the bounds set by the Slow Policy. Reads the plan from the shared store as a feature.
Why it works¶
- Decouples time horizons: Slow layer explicitly manages long-term effects (fatigue, churn) that fast optimization ignores.
- Stickiness: Members receive consistent experience over the planning window.
- Independent evolution: Each layer can be retrained, A/B tested, or replaced without touching the other.
- Clean separation of concerns: "how often" vs. "which one" become independent optimization problems.
Communication mechanism¶
A low-latency feature store bridges the two policies asynchronously — no synchronous coupling, no shared inference path. The Slow Policy writes strategic intent; the Fast Policy reads it as an input feature.
Seen in¶
- sources/2026-06-19-netflix-thinking-fast-slow-for-a-personalized-notification-system — Netflix's notification system: Slow Policy sets personalized weekly pacing plan from ~O(100) channel-frequency combinations; Fast Policy selects optimal message at each opportunity. Produced largest production metric lift to date, especially for infrequent viewers.