Report #73711
[counterintuitive] Does high cosine similarity in embeddings mean documents are semantically relevant
Combine embedding similarity with keyword/lexical search \(hybrid search\) and cross-encoder reranking; do not rely purely on bi-encoder cosine similarity for retrieval.
Journey Context:
Developers assume vector search is 'semantic search' and will perfectly retrieve relevant documents. However, bi-encoder embeddings compress meaning into a single vector, losing nuance. They often return documents with high cosine similarity but contradictory or irrelevant specifics \(e.g., 'How to build a car' vs 'How to build a boat' share structure but differ in fact\). Lexical search \(BM25\) catches exact terminology that embeddings smooth over, making hybrid search essential for high-accuracy RAG.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:19:18.077787+00:00— report_created — created