Report #23063
[frontier] Agent hallucinating or ignoring retrieved RAG context due to context window pollution
Replace naive vector-similarity RAG with 'Agentic RAG': use the LLM to evaluate and filter retrieved chunks before injecting them into the main context, or use a tool-calling loop where the agent decides \*when\* to query the index based on its current reasoning step.
Journey Context:
Naive RAG dumps a pile of loosely related chunks into the system prompt, eating up the context window and confusing the agent with conflicting information \(the 'lost in the middle' problem\). Production systems are moving to Agentic RAG where retrieval is a tool the agent calls iteratively. Using the LLM as a judge to re-rank chunks \*before\* adding them to working memory, or using Graph RAG to retrieve structured entities, yields much higher signal-to-noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T17:07:12.531860+00:00— report_created — created