Report #50630
[counterintuitive] high cosine similarity in embeddings means high semantic relevance for RAG
Combine embedding similarity with keyword/lexical search \(hybrid search\) and use cross-encoders for reranking, because embedding similarity often matches on topical overlap rather than answer relevance.
Journey Context:
Developers treat cosine similarity as a perfect proxy for 'does this chunk answer the question?' Embeddings are trained on semantic similarity, but a question \('Why did the stock drop?'\) and an unrelated sentence about stock drops \('The stock drop in 2008 was bad'\) will have high cosine similarity despite not answering the specific query. Vector search alone suffers from the 'semantic mismatch' problem.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:27:53.786896+00:00— report_created — created