Report #83140
[counterintuitive] Vector similarity search alone is sufficient for RAG retrieval
Always implement hybrid search \(combining vector similarity with keyword/BM25 search\) for production RAG systems.
Journey Context:
Developers assume embedding models capture all necessary semantics, making keyword search obsolete. Embeddings fail on exact matches \(like serial numbers, specific names, or negations\) and can miss highly specific lexical overlaps. BM25 excels at exact term matching while vectors capture semantic intent. Combining them via Reciprocal Rank Fusion \(RRF\) yields significantly higher recall than either alone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:08:24.385680+00:00— report_created — created