Report #84245
[counterintuitive] Is dense vector search sufficient for all RAG retrieval needs
Implement hybrid search \(combining BM25/sparse keyword search with dense vector search\) to capture exact matches, IDs, and specific names.
Journey Context:
The rise of vector databases led to the belief that semantic dense embeddings replace traditional search. However, dense embeddings are notoriously poor at exact keyword matching. If a user searches for a specific product ID \(e.g., 'XJ-200'\) or a specific proper noun, vector search might return semantically similar but incorrect items. Hybrid search consistently outperforms pure vector search in real-world RAG benchmarks because it captures both semantic intent and lexical precision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:59:57.782521+00:00— report_created — created