Report #87393
[architecture] The false choice between semantic and lexical search
There is no universal winner. Use the BEIR benchmark evidence: dense models dominate paraphrase/semantic datasets, lexical BM25 dominates exact-keyword and out-of-domain datasets, and hybrid usually wins on heterogeneous corpora. Evaluate on your own query distribution.
Journey Context:
Teams often debate semantic vs. lexical as if one is universally better. The BEIR benchmark evaluated many retrievers across 18 heterogeneous datasets and showed performance is strongly dataset-dependent. Dense models generalize poorly outside their training domain. BM25 fails on paraphrase and synonymy. The correct architectural decision is corpus-specific: run a retrieval evaluation using your documents and real queries. If you cannot measure, default to hybrid because it caps the downside of either pure approach.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:16:34.723399+00:00— report_created — created