Report #61764
[counterintuitive] Semantic vector search replaces keyword search for RAG
Use hybrid search \(combining vector embeddings and BM25 keyword search\) for production RAG pipelines.
Journey Context:
Embeddings compress meaning but lose exact lexical matches. If a user searches for a specific error code 'ERR-0x8A2F' or a proper name 'Acme Corp', dense vectors might return semantically similar but incorrect errors/names. BM25 catches exact tokens; vectors catch intent. Combining them via Reciprocal Rank Fusion \(RRF\) yields superior recall and prevents exact-match failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:09:42.655006+00:00— report_created — created