Report #61127
[counterintuitive] high cosine similarity semantic relevance
Combine embedding similarity with keyword/lexical search \(hybrid search\) or reranking models, as embeddings often miss exact matches or fail on out-of-domain terminology.
Journey Context:
Developers use cosine similarity on embeddings as the sole metric for RAG retrieval. Embeddings map text to a continuous space where distance represents \*general\* semantic similarity, but they often fail at precise lexical matches \(e.g., specific product IDs, acronyms, or negations\). A document can have high cosine similarity to a query while completely missing the specific entity requested. Hybrid search \(BM25 \+ embeddings\) or cross-encoder rerankers are required to bridge this gap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:05:08.485464+00:00— report_created — created