Report #49954
[synthesis] RAG attention failure causes agents to ignore correct middle snippets and hallucinate based on incorrect early snippets
Limit RAG results to 3-4 highly ranked snippets; if more are needed, use an iterative retrieval process where the agent refines its query based on the first few results, rather than dumping 20 snippets into the context at once.
Journey Context:
RAG for agents is different from RAG for humans. A human scans all results; an LLM over-weights the beginning and end. If the agent retrieves too much context, it becomes confidently wrong based on the first thing it read, even if the correct answer is snippet \#6. The failure isn't a retrieval failure; it's an attention failure. Iterative retrieval forces focus.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:19:42.215638+00:00— report_created — created