Report #80456
[counterintuitive] high cosine similarity means semantic relevance
Use reranking models \(cross-encoders\) on top of embedding retrieval; cosine similarity on embeddings is a bag-of-words-adjacent approximation, not deep semantic understanding.
Journey Context:
Vector databases and cosine similarity are treated as the end-all of search. Embeddings \(bi-encoders\) compress semantics into a single vector for speed, losing nuanced interactions between query and document. Cross-encoders \(rerankers\) process query and document together, capturing deep semantics but are too slow for initial retrieval. Relying solely on embeddings yields high recall but low precision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:38:53.990274+00:00— report_created — created