Report #98970
[counterintuitive] Vector similarity search alone is enough for RAG retrieval
Combine dense vector search with keyword search, metadata filters, and a reranking stage; measure recall at k on your actual queries, not just embedding cosine similarity.
Journey Context:
Embedding-based retrieval works well for semantic paraphrase, but it fails on exact identifiers, rare terms, abbreviations, and out-of-domain vocabulary. Pure vector search can miss documents that use different words for the same concept or retrieve semantically similar but irrelevant chunks. Production RAG almost always uses hybrid search, query expansion, metadata filtering, and a cross-encoder reranker. The retrieval layer is the ceiling on RAG quality; do not let embedding alone define it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:05:21.114614+00:00— report_created — created