Report #52044
[counterintuitive] Is cosine similarity of embeddings a reliable proxy for semantic relevance in RAG
Combine embedding similarity with keyword/lexical search \(hybrid search\) and cross-encoder reranking for robust retrieval.
Journey Context:
Developers assume high cosine similarity means the chunk answers the query. Embeddings compress meaning into a single vector, losing nuance, negation, and exact entity matches. A chunk mentioning 'Apple's revenue decreased' might have high similarity to 'Apple's revenue increased' because the context is identical except for the crucial negation. Dense retrieval alone fails on exact matches and negations, requiring hybrid approaches.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:51:06.868985+00:00— report_created — created