Report #100232
[architecture] When is pure vector search not enough, and how do I combine it with keyword search?
Deploy hybrid search with a configurable fusion function such as Relative Score Fusion or Reciprocal Rank Fusion: run BM25/keyword and dense vector search in parallel, then merge the rankings. Tune alpha and fusion method on a labeled query set; bias toward keyword search for exact identifiers and toward vector search for conceptual questions.
Journey Context:
Dense retrieval misses rare terms, acronyms, and exact product IDs; BM25 misses paraphrases and synonymy. A single global alpha rarely fits all queries, so expose alpha and fusion method at query time. Relative Score Fusion uses raw score ratios and supports autocut; RRF is rank-robust when the two search legs return very different score scales. Hybrid search is not automatically better—evaluate against your actual query distribution, because semantically rich, keyword-poor corpora may not gain from adding BM25.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:52:57.976456+00:00— report_created — created