Report #8526
[research] LLM ignores retrieved documents in RAG, especially when the relevant information is in the middle of the context window, and relies on parametric memory instead
Restructure RAG context to place the most critical documents at the very beginning and very end of the prompt, or force the model to quote the relevant snippet before generating the final answer.
Journey Context:
LLMs exhibit a U-shaped attention curve. Even with perfect retrieval, if the fact is in the middle of a long context, the model acts as if it's not there and hallucinates from its pre-trained weights. Forcing quotation acts as a grounding constraint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T05:43:52.787934+00:00— report_created — created