CONCEPT Cited by 1 source
Relative Score Fusion (RSF)¶
Definition¶
Relative Score Fusion (RSF) is a score-fusion technique for combining result lists from multiple independent retrieval methods into a single ranked list, using the raw scores from each retriever directly — after per-retriever normalization to put the scales on common ground. A typical form:
$$ \text{RSF}(d) = \sum_{r \in R} w_r \cdot \text{norm}_r(\text{score}_r(d)) $$
where $w_r$ is a per-retriever weight (tunable), and $\text{norm}_r$ is a per-retriever normalization (min-max, z-score, or sigmoid) that maps that retriever's scores to a shared target range (typically [0, 1]).
MongoDB's 2025-09-30 framing:
"[RSF] works directly with raw scores from different sources of relevance, using normalization to minimize outliers and align modalities effectively at a more granular level than rank alone can provide."
Why score-based instead of rank-based¶
Raw scores, when properly calibrated, encode confidence magnitude — information that pure rank-based methods like RRF discard:
- A BM25 score of 42.7 carries more information than "rank #1" — it quantifies how strong the keyword match actually is.
- A cosine-similarity score of 0.97 vs 0.62 at the same rank position tells you how confident the embedding model is.
- A retriever returning 10 documents all with scores near 0.01 is signalling low relevance across the board — rank alone erases this.
RSF keeps that magnitude and lets the fusion score reflect both cross-retriever consensus and per-retriever confidence.
The normalization problem¶
The tradeoff vs RRF is that raw scores across retrievers are incomparable by default — fixing that is the engineering work:
| Normalization | What it does | Good at | Weakness |
|---|---|---|---|
| Min-max | (s - min) / (max - min) per retriever's result set |
Simple, bounded [0,1] |
Sensitive to outliers; a single extreme-high score compresses the rest |
| Z-score | (s - μ) / σ per retriever |
Outlier-resistant | Doesn't bound the output; negative values |
| Sigmoid / tanh | 1 / (1 + exp(-s)) with per-retriever scale |
Bounded + smooth | Requires per-retriever scale calibration |
| Quantile / percentile | Map raw score to rank-percentile | Robust to any distribution | Loses absolute-magnitude info (approaches RRF) |
MongoDB's framing — "to minimize outliers and align modalities effectively at a more granular level than rank alone" — points at the middle-ground: normalizations that resist outliers (unlike naive min-max) but preserve magnitude information (unlike quantile/RRF).
Properties¶
- Preserves score-magnitude information. Good for retrievers with calibrated scores (BM25 is calibrated-enough in practice; cosine similarity is bounded; vector dot products often are too).
- Weighted — expressive. Per-retriever weights let you bias toward one modality (e.g.
α = 0.7toward vector search when the corpus is paraphrase-heavy). - Tunable per-workload. Offline eval with NDCG gives a principled way to tune weights and normalization choice.
Limitations¶
- Normalization is a hyperparameter. Wrong choice (min-max vs z-score vs sigmoid) distorts fusion — choice matters and doesn't have a universal default.
- Outlier-sensitive without careful design. Min-max in particular fails when one retriever emits a score blowout — one result dominates.
- Requires calibration effort. If one retriever's scores drift (embedding model update, BM25 parameter retune), the fusion needs re-tuning.
- More hyperparameters than RRF. RRF has one knob (
k); RSF has per-retriever weight + normalization method + (often) per-retrieverα.
Where it's used¶
- MongoDB Atlas's native hybrid search function. RSF is the second of the two canonical fusion techniques MongoDB's 2025-09-30 post identifies alongside RRF: "Both approaches quickly gained traction and have become standard techniques in the market."
- Weaviate's hybrid search (named "relativeScoreFusion" in their API) — the default historically was RRF; RSF is a first-class option since 1.20.
- Many custom production stacks — ad-hoc weighted-sum fusion with min-max normalization is a common DIY shape that is, by MongoDB's taxonomy, an RSF variant.
- Figma's score-fusion — min-max per index + exact-match boost + interleave; the boost-plus-normalize shape is a specific RSF realization sitting beside generic RSF in the design space.
When to pick RSF over RRF¶
- Retrievers with well-calibrated raw scores — cosine similarity, Figma-style min-max-normalized lexical + vector, learned retrievers trained with probability outputs.
- Workloads where fine-grained rank differentiation matters — user-facing search where you need to rank item 4 vs item 5 correctly, not just "both in top 10".
- When one modality is measurably stronger — the weighted-sum form makes the bias explicit and tunable.
RSF loses to RRF on: new retrievers that haven't been score-calibrated, heterogeneous retriever mixes (BM25 + vector + rules + LTR), and teams without eval infrastructure to tune the weights.
Seen in¶
- sources/2025-09-30-mongodb-top-considerations-when-choosing-a-hybrid-search-solution — MongoDB names RSF alongside RRF as one of the two standard fusion techniques: "RSF, on the other hand, works directly with raw scores from different sources of relevance, using normalization to minimize outliers and align modalities effectively at a more granular level than rank alone can provide."
Related¶
- concepts/reciprocal-rank-fusion — the rank-based alternative; complementary design point on the fusion-technique axis.
- concepts/hybrid-retrieval-bm25-vectors — the retrieval stack RSF fuses over.
- concepts/hybrid-search — namespace-collision page (vector + metadata filter); RSF applies to fusion of multi-retriever hybrid retrieval.
- patterns/hybrid-lexical-vector-interleaving — Figma's specific RSF-family realization with min-max + exact-match boost + interleave.
- patterns/native-hybrid-search-function — the productization pattern where RSF is exposed as a database primitive.
- systems/atlas-hybrid-search — MongoDB's native hybrid-search function that exposes RSF-style fusion.
- systems/bm25 — one of the two retrievers most commonly fused via RSF.