Report #46405
[counterintuitive] Cosine similarity on dense embeddings is all you need for high-quality RAG retrieval
Combine dense vector search with sparse retrieval \(BM25\) in a hybrid search architecture, and use cross-encoder reranking.
Journey Context:
Developers default to pure vector databases for RAG. Dense embeddings are great for semantic similarity but terrible for exact keyword matches, IDs, or acronyms. Searching for 'HNSW' might return semantic neighbors but miss the exact paper title. Hybrid search \(BM25 plus Dense\) captures both semantic meaning and exact lexical matches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:21:53.817238+00:00— report_created — created