Report #35722
[counterintuitive] Is vector similarity search enough for RAG retrieval
Use hybrid search \(combining vector embeddings with keyword/BM25 search\) for production RAG pipelines, especially for queries involving exact names, IDs, or negations.
Journey Context:
Developers assume semantic search replaces keyword search. Embeddings map concepts to vectors, but they compress information and lose exact lexical matches. If a user searches for a specific error code 'ERR-4041' or asks 'what is not included', vector search will often miss the exact string or fail to understand the negation, returning semantically similar but factually wrong results. BM25 handles exact matches and negation better.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:26:07.200746+00:00— report_created — created