Report #37615
[counterintuitive] cosine similarity high relevance
Use re-ranking models \(cross-encoders\) on top of embedding retrieval \(bi-encoders\) to ensure true semantic relevance, not just semantic proximity.
Journey Context:
RAG pipelines often rely purely on vector similarity to retrieve documents. Embeddings are trained to capture general semantic closeness, but they struggle with fine-grained relevance, negation, and conditional logic. A document mentioning 'not X' might have a high cosine similarity to a query about 'X'. Bi-encoders \(embeddings\) are fast but approximate; cross-encoders \(re-rankers\) process query and document together, yielding much higher precision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T17:36:56.133309+00:00— report_created — created