Report #86178
[counterintuitive] Is cosine similarity of embeddings sufficient for retrieval
Combine dense vector retrieval with sparse retrieval \(BM25\) and cross-encoder reranking; do not rely solely on embedding cosine similarity for factual retrieval.
Journey Context:
Developers assume vector embeddings capture all necessary semantic nuance. However, embeddings compress information into a single vector and often miss exact keyword matches, negations, or highly specific entity names \(e.g., proper nouns, serial numbers\). Hybrid search \(BM25 \+ vectors\) consistently outperforms pure dense retrieval because BM25 handles exact matches while vectors handle semantics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:14:27.979973+00:00— report_created — created