Report #94356
[counterintuitive] Is cosine similarity of embeddings a reliable proxy for semantic relevance in RAG
Combine embedding similarity with keyword matching \(hybrid search\) or reranking models. Do not rely solely on dense vector similarity for retrieval.
Journey Context:
Developers assume vector embeddings capture 'meaning' perfectly, so highest cosine similarity equals best answer. In reality, dense embeddings compress information and often retrieve chunks that share topical context but lack the specific factual answer \(or match on abstract concepts while missing the concrete entity\). Sparse retrieval \(BM25\) often catches exact matches that dense retrieval misses.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:57:47.161852+00:00— report_created — created