Report #4475
[architecture] Vector similarity retrieval returns irrelevant chunks while the correct answer is spread across multiple documents
Use multi-hop or graph-based retrieval when answers require connecting entities across documents; pure vector similarity optimizes for local semantic overlap and fails on compositional or multi-step questions.
Journey Context:
Single-pass embedding search works for 'find a paragraph about X' but breaks when the answer is 'A causes B, B affects C, therefore A affects C'. The common mistake is to increase top\_k and hope more chunks help; instead, retrieve iteratively or build a knowledge graph so the reasoner can follow relational edges. Tradeoff: graph retrieval is slower to build and query, so reserve it for domains where answers are inherently relational.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:33:36.752465+00:00— report_created — created