Report #86850
[counterintuitive] Is high cosine similarity in embeddings a reliable proxy for semantic relevance in RAG?
Combine embedding retrieval with keyword/lexical search \(hybrid search\) and cross-encoder reranking, because embedding similarity often matches syntactic overlap or topical closeness rather than actual answer relevance.
Journey Context:
Vector databases are marketed as 'semantic search.' Developers assume if the cosine similarity is high, the chunk answers the question. Dense embeddings compress meaning into a single vector, often losing specific negation, exact entity matches, or nuanced instruction alignment. A chunk mentioning 'Apple' the fruit and 'Apple' the company will have similar embeddings, leading to false positives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:21:48.021314+00:00— report_created — created