Report #85536
[counterintuitive] Is vector search enough for building a RAG system
Combine vector search \(dense retrieval\) with keyword search \(sparse retrieval like BM25\) using hybrid search, or use specialized retrieval models. Pure vector search fails on exact matches, acronyms, and specific identifiers.
Journey Context:
Developers equate RAG with sticking text in a vector database. Vector embeddings excel at semantic similarity but are famously terrible at exact keyword matching, specific IDs \(like part numbers or names\), and out-of-distribution acronyms. If a user searches for 'HNSW', a vector DB might return results about 'graph algorithms' generally, missing the exact documentation page for HNSW.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:09:24.436532+00:00— report_created — created