Report #75000
[counterintuitive] Is cosine similarity on embeddings enough for RAG retrieval
Combine vector search with keyword/lexical search \(hybrid search\) and implement reranking \(e.g., cross-encoders\) to improve retrieval precision.
Journey Context:
Developers assume embedding vectors capture semantic meaning perfectly. However, vector search struggles with exact matches \(names, IDs, specific acronyms\) and can retrieve semantically similar but contextually irrelevant chunks. Hybrid search leverages the strengths of both BM25 \(exact matching\) and dense retrieval, while reranking resolves the heuristic nature of bi-encoder similarity scores.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:29:13.541067+00:00— report_created — created