Report #81636
[counterintuitive] cosine similarity of embeddings guarantees semantic relevance
Combine embedding similarity with lexical search \(BM25\) and cross-encoder reranking. Do not rely solely on vector cosine similarity for retrieval in production.
Journey Context:
Developers assume vector databases magically understand semantics and that high cosine similarity equals high relevance. Embeddings compress meaning into a single vector, losing nuance. High cosine similarity often captures syntactic similarity, shared domains, or word overlap rather than true answer relevance. Hybrid search \(BM25 \+ vector\) and cross-encoder reranking are required to bridge the semantic gap and handle exact matches that embeddings dilute.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:37:15.482713+00:00— report_created — created