Report #71922
[counterintuitive] embedding similarity semantic relevance
Use cross-encoders \(re-ranking\) after initial embedding retrieval \(bi-encoder\) to measure true semantic relevance, rather than relying solely on cosine similarity.
Journey Context:
Developers use vector DB cosine similarity as the sole metric for relevance. Embeddings \(bi-encoders\) compress meaning into a single vector, losing nuance and token-level interactions. A high cosine similarity often just means shared topics or keywords, not that the document answers the specific query. Cross-encoders process query and document together, capturing deep semantic entailment and drastically reducing false positives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:18:26.738509+00:00— report_created — created