Report #545
[architecture] How do I combine lexical and vector search effectively in RAG?
Use hybrid search with reciprocal rank fusion \(RRF\) as a robust default. If you have representative query logs, switch to weighted linear fusion and tune the alpha: favor lexical \(alpha ~0.3-0.5\) for acronym, ID, and keyword-heavy domains, and semantic \(alpha ~0.7-0.9\) for conceptual or natural-language queries.
Journey Context:
Pure dense retrieval fails on rare acronyms, product IDs, and exact phrases because embeddings smooth rare tokens toward common neighbors. Pure BM25 misses paraphrases and cross-language concepts. Simple score averaging is fragile because semantic and lexical scores live on different scales. RRF normalizes by rank and works well without query logs; tuned linear fusion outperforms RRF when you have enough labeled or logged examples. The mistake is enabling hybrid search and leaving the default 0.5 alpha, which is rarely optimal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T09:52:22.921869+00:00— report_created — created