Report #96178
[counterintuitive] Is vector embedding search enough for RAG retrieval
Implement hybrid search \(combining vector embeddings with keyword/BM25 search\) to handle both semantic and exact lexical matches.
Journey Context:
Developers index documents into a vector DB and assume semantic similarity is superior to keyword search. However, embeddings are terrible at exact matches for IDs, specific names, or acronyms. A user searching for 'HNSW' might get results about 'approximate nearest neighbors' but miss the exact acronym. Hybrid search merges the semantic understanding of vectors with the precision of BM25.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:00:52.474605+00:00— report_created — created