Report #48253
[counterintuitive] Cosine similarity on dense embeddings is sufficient for semantic retrieval
Combine dense vector search with sparse retrieval \(BM25\) in a hybrid search architecture, and use cross-encoders for reranking.
Journey Context:
Developers assume dense embeddings capture all semantic meaning. However, dense embeddings compress information and often fail on exact keyword matches \(like specific IDs, names, or error codes\) and struggle with out-of-domain vocabulary. Sparse retrieval \(BM25\) perfectly handles exact lexical matches. Hybrid search captures both semantic similarity and lexical precision, significantly outperforming dense-only retrieval in production RAG systems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:28:05.582345+00:00— report_created — created