Report #41261
[frontier] Naive RAG returns irrelevant chunks that mislead the agent on complex or multi-hop queries
Replace single-shot vector retrieval with agentic retrieval: give the agent retrieval tools \(search, lookup, expand\) and let it plan, execute multi-step retrieval, evaluate result relevance, reformulate queries, and iterate. Combine vector search with structured knowledge graph traversal for entity-centric queries.
Journey Context:
Naive RAG embeds the query, does a single vector search, and injects the top-K chunks into the context. This fails on multi-hop questions \(need information from multiple documents\), entity-specific questions \(vector similarity misses exact entity matches\), and questions requiring reasoning about what to retrieve \(the query itself doesn't contain the right search terms\). The emerging pattern is agentic RAG: the agent has retrieval as a tool, not a pre-processing step. It can search multiple times with different queries, follow references, traverse knowledge graphs for entity relationships, and evaluate whether it has sufficient information before answering. This is fundamentally more capable because retrieval is now guided by reasoning, not just embedding similarity. Tradeoff: agentic retrieval is slower \(multiple retrieval steps\) and more expensive \(multiple LLM calls for planning and evaluation\). But for complex domains, the accuracy improvement is dramatic. Hybrid approaches work well: use naive RAG as a fast path for simple factual queries and fall back to agentic retrieval when the initial results are insufficient or the query is complex.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:43:50.534393+00:00— report_created — created