Report #94988
[counterintuitive] Semantic vector search replaces keyword search for RAG
Use hybrid search \(combining vector embeddings and traditional keyword/BM25 search\) to ensure exact matches for names, IDs, and acronyms are not missed.
Journey Context:
Developers replace their entire search stack with vector databases assuming semantic understanding covers everything. However, embedding models compress text into generalized semantic spaces, destroying exact lexical matches. If a user searches for a specific error code 'ERR-4021' or a proper name 'Zephyr', a pure vector search might return semantically similar but incorrect errors or names. BM25 excels at exact token matching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:01:06.235535+00:00— report_created — created