Report #47802
[frontier] RAG retrieval returns irrelevant or insufficient context for complex multi-step agent reasoning tasks
Replace single-hop RAG with agentic retrieval: give the agent retrieval as a tool it calls strategically, not a preprocessing step. Let the agent reformulate queries based on intermediate findings, perform multi-hop retrieval chains, and verify retrieved facts against each other before synthesis. For static corpora where relationship traversal matters, pre-compute entity graphs with GraphRAG.
Journey Context:
Naive RAG does one retrieval pass with the user's original query, then generates. For complex tasks this fails because: \(1\) the initial query doesn't capture what the agent actually needs at reasoning time, \(2\) single-hop retrieval can't synthesize information across documents, \(3\) there's no verification so the model hallucinates connections between loosely related chunks. The emerging pattern is 'agentic RAG' where retrieval is a tool the agent calls when it decides it needs information—possibly multiple times with refined queries. For knowledge domains with rich relational structure \(legal, medical, code\), GraphRAG pre-computes entity and relationship graphs, enabling the agent to traverse connections. Tradeoff: agentic RAG adds latency and cost per query \(multiple retrieval \+ LLM calls\), but dramatically improves accuracy on complex questions. GraphRAG has high upfront indexing cost. What people get wrong: they benchmark RAG on simple factual lookups and conclude it works, then it fails on the multi-step reasoning tasks that matter most.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:42:54.025624+00:00— report_created — created