Report #35645
[counterintuitive] embedding similarity equals semantic relevance
Combine vector similarity with keyword/lexical search \(hybrid search\) and use cross-encoders for reranking the top-k results before passing to the LLM.
Journey Context:
Developers assume high cosine similarity means the document answers the question. Embeddings are lossy compressions optimized for general semantic similarity, not specific question-answering relevance. They miss exact keyword matches \(like product IDs or names\) and often retrieve documents that are topically similar but lack the specific answer. Hybrid search \+ reranking is the industry standard fix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:18:07.496230+00:00— report_created — created