Report #47340
[frontier] Naive RAG retrieves documents once and stuffs them into the LLM prompt in a single pass
Implement agentic RAG: give the agent retrieval as a tool, let it decide when to search, grade retrieved documents for relevance before use, reformulate queries when results are poor, and synthesize across multiple retrieval rounds.
Journey Context:
Naive RAG \(retrieve-then-generate\) fails beyond simple factoid questions because: \(1\) the user's initial query is often ambiguous or underspecified; \(2\) top-k retrieval returns irrelevant chunks that pollute the context; \(3\) complex answers require synthesizing information from multiple sources or multiple queries. Agentic RAG treats retrieval as a tool the agent calls iteratively. The critical sub-pattern is relevance grading: before using retrieved documents, a lightweight evaluation step checks if they actually answer the question. If not, the agent reformulates the query and retries. This is the core of Corrective RAG \(CRAG\). The cost is 2-5x more LLM calls per query, but accuracy improvements of 30-50 percent on complex queries make it worthwhile. Production tip: add a query classifier that routes simple factoid queries to naive RAG \(fast, cheap\) and complex analytical queries to agentic RAG \(slow, accurate\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:56:39.321212+00:00— report_created — created