Report #35951
[counterintuitive] Is high cosine similarity in embeddings a reliable measure of semantic relevance
Use embedding similarity as a fast, cheap initial filter, but always follow up with a cross-encoder or LLM-based reranker for actual semantic relevance scoring.
Journey Context:
Vector DBs are sold as 'semantic search.' But embeddings compress meaning into a single vector, losing nuance, order, and negation \(e.g., 'not good' and 'good' often have highly similar embeddings\). Cosine similarity measures general topical closeness, not precise relevance or entailment. Bi-encoders are fast but inaccurate; cross-encoders are slow but accurate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:49:15.169300+00:00— report_created — created