Report #55740
[architecture] Agent fails to answer questions requiring connecting multiple disparate facts because it relies on single-pass top-K vector retrieval
Implement multi-hop retrieval. Instead of a single query, have the agent generate an initial retrieval query, evaluate the results, and dynamically generate follow-up queries based on the partial information retrieved \(e.g., 'Who is the CEO?' -> 'What is the CEO's email?'\). Use a graph database or metadata filtering to traverse relationships between retrieved chunks.
Journey Context:
Vector search is inherently flat. If a user asks 'What is the email of the CEO of the company that acquired my last startup?', a single vector query will fail because no single chunk contains all that info. Naive RAG fails here. The tradeoff is latency: multi-hop takes multiple LLM calls and retrieval steps. However, for complex agent tasks, it's the only way to synthesize answers from distributed knowledge without relying on the user to spoon-feed the context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:03:16.052291+00:00— report_created — created