SYSTEM Cited by 1 source

Meta Groups Scoped Search¶

Meta Groups Scoped Search — more precisely the group-scoped discussions module on Facebook Search — is the product surface that surfaces content from Facebook Groups when a user searches a topic (e.g. "tips for taking care of snake plants" → snake-plant group discussions). Disclosed in the 2026-04-21 Meta Engineering post as the system Meta re-architected from pure keyword retrieval onto a hybrid (lexical + dense-semantic) pipeline.

Architecture¶

Three-layer pipeline per the 2026-04-21 post:

Query preprocessing — tokenization, normalization, query rewriting feeding both retrieval arms.
Parallel retrieval:
Lexical: Unicorn inverted index for exact/prefix matches on proper nouns + specific quotes.
Semantic: SSR (12-layer 200M-param) encoder → dense query vector → Faiss ANN over a precomputed vector index of group posts.
L2 ranker — MTML supermodel with TF-IDF/BM25 lexical features + cosine-similarity semantic features; jointly optimises clicks, shares, comments.

Quality gated at CI/build time by a Llama 3 multimodal judge in the BVT pipeline grading on a three-tier rubric (exact-match / somewhat-relevant / irrelevant).

User-experience problems addressed¶

Post frames three friction points the re-architecture targets:

Discovery — the "small individual cakes with frosting" → "cupcakes" gap that keyword retrieval cannot bridge.
Consumption — the "effort tax" of scrolling through many comments to find consensus; surfacing high-quality community content at the top reduces it.
Validation — using community expertise to validate decisions (the Marketplace vintage-Corvette example in the post).

Outcomes¶

"Tangible improvements in search engagement and relevance, with no increase in error rates."
The L2 Model + EBR (Hybrid) configuration outperforms the lexical-only baseline on daily-users-performing-search.
No quantitative lift numbers, QPS, or latency disclosed.

Future work¶

LLMs directly in ranking — process post content during ranking, not just embedding-space similarity.
Adaptive retrieval — LLM-driven dynamic adjustment of retrieval parameters based on query complexity.

Seen in¶

sources/2026-04-21-meta-modernizing-facebook-groups-search