Report #65822
[counterintuitive] semantic search with embeddings is sufficient for RAG
Implement a hybrid search \(combining keyword/BM25 and vector search\) and use a cross-encoder reranker to improve retrieval precision.
Journey Context:
Developers assume dense vector embeddings capture all necessary semantics, replacing keyword search. However, embeddings struggle with exact matches \(names, IDs, acronyms\) and out-of-domain terminology. A user searching for 'HNSW' might get results about 'graph algorithms' generally. BM25 excels at exact token matching. Hybrid search combines the semantic breadth of vectors with the precision of keywords, and reranking resolves the semantic mismatch.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:57:41.935127+00:00— report_created — created