Report #87269
[architecture] Agent fails to answer complex questions requiring connecting multiple disparate pieces of information because single-pass vector search only retrieves local chunks
Implement multi-hop retrieval: use the initial retrieval results to generate secondary search queries, iterating until the accumulated context satisfies the original prompt or a hop limit is reached.
Journey Context:
Vector search is fundamentally a local similarity match. If the answer requires 'What library did the author of X use?', a single search will either find X or the library, but rarely both. The tradeoff is latency and cost: multi-hop takes multiple LLM calls and embedding searches. However, single-hop guarantees failure on multi-hop questions. Alternatives like graph databases \(GraphRAG\) work but require upfront schema design; multi-hop retrieval is schema-less and more adaptable for general agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:04:18.843620+00:00— report_created — created