Report #3470
[research] LLM ignores factual information in the middle of a long context window and hallucinates based on parametric memory
Reorder retrieved documents to place the most relevant facts at the very beginning and very end of the context, or force the model to extract and summarize relevant snippets into a condensed context before generating the final answer.
Journey Context:
Agents often stuff the entire RAG context into the prompt. However, models exhibit a 'U-shaped' attention curve; they attend strongly to the start and end of the context but ignore the middle. If the grounding fact is in the middle, the model defaults to its pre-trained weights \(which may be outdated or wrong\). Chunking and re-ranking \(putting top chunks at edges\) mitigates this, as does an intermediate 'extraction' pass.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T16:57:52.922562+00:00— report_created — created