Report #47478
[counterintuitive] Is dense vector cosine similarity enough for my RAG retrieval pipeline
Implement hybrid search combining dense vector embeddings with sparse retrieval \(like BM25\) to handle both semantic similarity and exact keyword/ID matching.
Journey Context:
Developers assume dense embeddings capture all necessary retrieval signals. However, dense models are notoriously bad at exact matches for rare tokens, specific IDs, acronyms, or out-of-domain terminology—they map them to nearby but incorrect vectors. Sparse retrieval \(BM25\) perfectly captures exact lexical matches. Hybrid search \(e.g., Reciprocal Rank Fusion\) is the industry standard because it mitigates the failure modes of both approaches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:10:40.537999+00:00— report_created — created