Report #90147
[counterintuitive] Vector similarity search is sufficient for RAG retrieval
Combine vector search with keyword/lexical search \(hybrid search\) and implement metadata filtering for precise RAG pipelines.
Journey Context:
The belief is that semantic embeddings perfectly capture intent. In reality, pure vector similarity fails on exact matches \(like product IDs, specific names, or error codes\), struggles with negation \('features that are NOT X'\), and can return conceptually adjacent but practically irrelevant documents. Hybrid search \(BM25 \+ Dense\) mitigates this by preserving exact token matching alongside semantic proximity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:54:20.720938+00:00— report_created — created