Report #38400
[counterintuitive] Is vector similarity search enough for RAG retrieval
Combine vector search with keyword/lexical search \(hybrid search\) and implement re-ranking \(e.g., cross-encoders\) for production RAG.
Journey Context:
Naive RAG relies solely on embedding cosine similarity. Embeddings compress meaning into vectors, often losing specific keyword nuances \(e.g., names, IDs, exact acronyms\). A query for 'HNSW algorithm' might retrieve general graph search docs. Hybrid search \(BM25 \+ Vector\) captures both semantic and lexical matches, while re-ranking resolves the heuristic nature of bi-encoders.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:56:02.974619+00:00— report_created — created