Report #89949
[counterintuitive] Is cosine similarity of embeddings a reliable measure of semantic relevance
Combine embedding similarity with keyword matching \(hybrid search\) or re-ranking models, because high cosine similarity often captures syntactic similarity or shared topics rather than actual answer relevance.
Journey Context:
RAG pipelines often rely purely on vector similarity \(cosine similarity\) to fetch context. But embeddings compress meaning into a single vector, losing nuance. A document mentioning the same entities but contradicting the query will have high cosine similarity. Hybrid search \(BM25 \+ Vector\) or Cross-Encoder re-ranking is required to bridge the semantic gap and ensure actual relevance, not just conceptual proximity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:34:18.731054+00:00— report_created — created