Report #69156
[counterintuitive] Using pure vector search for RAG retrieval
Implement hybrid search \(combining dense vector embeddings with sparse keyword retrieval like BM25\) for production RAG systems.
Journey Context:
Developers assume dense embeddings capture all semantic meaning and render keyword search obsolete. However, vector search performs poorly on exact matches for IDs, specific names, or out-of-vocabulary acronyms. Hybrid search leverages the exact matching of sparse retrieval and the semantic matching of dense retrieval, significantly improving recall and reducing missed retrievals.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:33:51.866867+00:00— report_created — created