SYSTEM Cited by 1 source
Meta Groups Scoped Search¶
Meta Groups Scoped Search — more precisely the group-scoped discussions module on Facebook Search — is the product surface that surfaces content from Facebook Groups when a user searches a topic (e.g. "tips for taking care of snake plants" → snake-plant group discussions). Disclosed in the 2026-04-21 Meta Engineering post as the system Meta re-architected from pure keyword retrieval onto a hybrid (lexical + dense-semantic) pipeline.
Architecture¶
Three-layer pipeline per the 2026-04-21 post:
- Query preprocessing — tokenization, normalization, query rewriting feeding both retrieval arms.
- Parallel retrieval:
- Lexical: Unicorn inverted index for exact/prefix matches on proper nouns + specific quotes.
- Semantic: SSR (12-layer 200M-param) encoder → dense query vector → Faiss ANN over a precomputed vector index of group posts.
- L2 ranker — MTML supermodel with TF-IDF/BM25 lexical features + cosine-similarity semantic features; jointly optimises clicks, shares, comments.
Quality gated at CI/build time by a Llama 3 multimodal judge in the BVT pipeline grading on a three-tier rubric (exact-match / somewhat-relevant / irrelevant).
User-experience problems addressed¶
Post frames three friction points the re-architecture targets:
- Discovery — the "small individual cakes with frosting" → "cupcakes" gap that keyword retrieval cannot bridge.
- Consumption — the "effort tax" of scrolling through many comments to find consensus; surfacing high-quality community content at the top reduces it.
- Validation — using community expertise to validate decisions (the Marketplace vintage-Corvette example in the post).
Outcomes¶
- "Tangible improvements in search engagement and relevance, with no increase in error rates."
- The L2 Model + EBR (Hybrid) configuration outperforms the lexical-only baseline on daily-users-performing-search.
- No quantitative lift numbers, QPS, or latency disclosed.
Future work¶
- LLMs directly in ranking — process post content during ranking, not just embedding-space similarity.
- Adaptive retrieval — LLM-driven dynamic adjustment of retrieval parameters based on query complexity.
Seen in¶
Related¶
- systems/meta-ssr-search-semantic-retriever · systems/meta-unicorn-inverted-index · systems/faiss · systems/llama-3
- concepts/hybrid-retrieval-bm25-vectors · concepts/multi-task-multi-label-ranking
- patterns/decoupled-parallel-retrieval-pipelines · patterns/llm-judge-in-build-verification-test
- companies/meta