Report #66282
[frontier] Naive RAG \(embed query, vector search, stuff chunks into prompt\) produces irrelevant or insufficient context for complex multi-hop questions
Replace single-shot RAG with agentic retrieval: give the agent retrieval tools \(search, lookup, expand, traverse\) and let it iteratively query, evaluate result sufficiency, refine queries, and traverse relationships until it has enough evidence to answer
Journey Context:
Naive RAG treats retrieval as a one-shot operation: embed the query, find top-k chunks, stuff them in the prompt. This breaks on: \(1\) multi-hop questions requiring synthesis across documents, \(2\) queries where the user's question phrasing does not match document phrasing, \(3\) situations where you do not know what you do not know until you start reading results. The winning pattern is agentic retrieval: the agent has search/lookup tools and decides what to retrieve, when, and whether it has enough. Microsoft's GraphRAG adds entity and relationship extraction to this, enabling traversal of knowledge graphs for community-level summarization. The tradeoff: agentic retrieval costs more tokens and latency per query. But it dramatically improves answer quality for anything beyond simple factoid questions. The key insight: retrieval is not a pre-processing step, it is an agent capability that should be iterated on.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:43:48.237302+00:00— report_created — created