Report #65604
[counterintuitive] cosine similarity alone is sufficient for RAG retrieval
Combine dense vector retrieval with sparse retrieval \(BM25\) in a hybrid search, and use cross-encoders/rerankers on the top-K results.
Journey Context:
Cosine similarity on dense embeddings captures general semantic closeness but often misses exact keyword matches \(like IDs, specific names, or acronyms\) and can retrieve topically related but non-answer-bearing chunks. Hybrid search \(BM25 \+ vector\) and reranking significantly outperform pure vector search by combining lexical precision with semantic breadth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:36:11.470166+00:00— report_created — created