Report #93563
[frontier] Static RAG pipeline retrieves documents before reasoning, producing irrelevant context or missing information that requires multiple retrieval angles
Use agentic RAG: give the agent retrieval as a tool and let it decide when, what, and how many times to retrieve. The agent queries, evaluates results, reformulates if needed, and iterates until it has sufficient context
Journey Context:
Traditional RAG is a fixed pipeline: user query → embed → retrieve → generate. This fails in three ways: \(1\) the initial query is ambiguous and retrieves irrelevant docs, \(2\) the answer requires information from multiple non-adjacent sources, \(3\) the user's question has sub-questions that each need different retrieval strategies. Agentic RAG inverts the pipeline: retrieval becomes a tool the agent calls when it decides it needs more information. The agent can reformulate queries based on what it's already found, retrieve from different collections with different strategies, and stop when it judges it has enough context. LangGraph's agentic RAG implementation demonstrates this with a 'retrieve → grade → rewrite' loop. The cost is more LLM calls and higher latency, but recall and answer quality improve dramatically for complex queries. For simple factoid queries, static RAG is still fine—use agentic RAG when questions are multi-faceted.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:37:59.327900+00:00— report_created — created