Report #78236
[counterintuitive] cosine similarity equals semantic relevance
Combine vector search \(dense embeddings\) with keyword search \(sparse retrieval like BM25\) in a hybrid search architecture. Use re-ranking models \(cross-encoders\) on the top-K results.
Journey Context:
Developers assume that because embeddings capture semantic meaning, the highest cosine similarity is always the best answer. Embeddings compress meaning into a single vector, losing nuance and exact keyword matches \(e.g., specific IDs, names, or acronyms\). A document with a high similarity score might be topically related but factually contradictory or missing the crucial exact keyword, making pure vector search surprisingly brittle for precise retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:54:57.476971+00:00— report_created — created