Report #64288
[counterintuitive] high embedding cosine similarity guarantees answer relevance
Combine vector search with keyword search \(hybrid search\) and use a cross-encoder/reranker for actual relevance scoring.
Journey Context:
Developers assume if a text chunk has a high cosine similarity to the query, it contains the answer. Embeddings compress meaning into a single vector, often capturing topical similarity rather than answer-specific relevance. For example, a chunk asking the exact same question as the query has high similarity but zero answer value. Rerankers \(cross-encoders\) evaluate query-document pairs jointly, solving this dilution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:23:45.186058+00:00— report_created — created