Report #77800
[counterintuitive] high cosine similarity means semantic relevance
Use hybrid search \(combining keyword/BM25 and vector search\) and reranking models \(cross-encoders\) instead of relying solely on embedding cosine similarity for retrieval.
Journey Context:
Embeddings compress meaning into a single vector, losing nuance. Cosine similarity often retrieves documents that share topical words but contradict the query \(e.g., query 'Why is X bad?' retrieves 'X is great'\). Bi-encoders \(embeddings\) are fast but shallow; cross-encoders \(rerankers\) are slow but actually read query\+doc together, drastically improving precision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:11:13.781883+00:00— report_created — created