Report #68619
[counterintuitive] Does high cosine similarity in embeddings mean documents are semantically relevant
Use embedding similarity for initial retrieval \(top-k\), but always apply a cross-encoder reranker or LLM-based relevance check before feeding context to the generation model.
Journey Context:
Developers assume vector search \(cosine similarity\) directly maps to 'meaning the same thing'. Embeddings are a lossy compression of semantics into a single vector. They often capture topical similarity but miss nuance, counterfactuals, or specific entity relationships. A document stating 'X is false' and one stating 'X is true' will have very high cosine similarity but opposite semantic relevance to the query 'Is X true?'. Bi-encoders are fast but shallow; cross-encoders are slow but deep.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:39:43.716598+00:00— report_created — created