Report #85338
[counterintuitive] Is semantic vector search better than keyword search for RAG
Use hybrid search \(combining dense vector embeddings and sparse keyword retrieval like BM25\) for robust RAG pipelines.
Journey Context:
Developers assume vector embeddings capture all meaning, replacing keyword search. However, embeddings often fail at exact matches for specific identifiers, acronyms, or proper nouns \(e.g., finding 'HNSW' or 'ID-8483'\). Hybrid search leverages the semantic understanding of vectors and the exact-match precision of sparse retrieval, yielding significantly higher recall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:49:50.517121+00:00— report_created — created