Report #98487
[counterintuitive] Pure semantic vector search is enough for high-quality RAG retrieval
Use hybrid search \(dense \+ sparse/BM25\), metadata filters, query rewriting, and a cross-encoder reranker; choose retrieval strategy by query type.
Journey Context:
Dense embeddings capture semantic similarity but miss exact identifiers, rare terms, and typos. Pinecone's hybrid-search documentation explains that combining dense vectors with sparse keyword vectors improves relevance, especially for out-of-domain queries. Production RAG usually runs retrieval, reranking, and metadata filtering as separate stages rather than relying on a single vector similarity lookup.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:03:31.532320+00:00— report_created — created