Report #96955
[counterintuitive] Is vector similarity search enough for RAG retrieval
Combine vector search with keyword/lexical search \(hybrid search\) and use a reranker model to score the final top-k results.
Journey Context:
Developers assume cosine similarity on embeddings captures all semantic and lexical relevance. Embeddings are lossy compressions optimized for general semantic similarity, often missing exact keyword matches \(like specific IDs, names, or error codes\) or nuanced query-document relevance. Hybrid search \(BM25 \+ Vector\) and cross-encoder reranking significantly outperform pure vector search for production RAG systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:19:21.671685+00:00— report_created — created