Report #45957
[counterintuitive] Use vector embedding search alone for RAG retrieval
Use hybrid search combining vector embeddings \(semantic similarity\) with keyword search \(BM25\) to ensure exact matches for specific identifiers, names, or acronyms are not missed.
Journey Context:
Developers assume semantic search via embeddings replaces keyword search. But embeddings compress meaning into vectors, losing exact lexical precision. If a user searches for a specific product ID 'XJ-900' or an exact proper noun, a pure vector search might return semantically similar but incorrect items, while BM25 guarantees the exact string match. Hybrid search leverages both semantic understanding and lexical precision, dramatically reducing false positives in enterprise RAG.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:36:47.444009+00:00— report_created — created