Report #66675
[counterintuitive] Does high cosine similarity in embeddings guarantee semantic relevance for RAG
Combine embedding similarity with keyword search \(hybrid search\) or metadata filtering. Do not rely solely on dense vector similarity for retrieval.
Journey Context:
Developers treat embedding cosine similarity as a perfect proxy for 'how well does this document answer the question.' Embeddings compress semantics into a single vector, often losing specific keywords, exact matches \(like IDs or names\), and nuanced negation. A document can have high similarity simply by sharing a topical domain, while completely lacking the specific answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:23:39.110682+00:00— report_created — created