Report #9415
[research] Missing factual information provided in the middle of a long RAG context window
Place the most critical retrieved documents at the very beginning and very end of the prompt context; avoid burying crucial grounding data in the middle of a long context.
Journey Context:
Models exhibit a U-shaped attention curve. They attend heavily to the system prompt \(start\) and the immediate query \(end\), but suffer severe performance degradation on information located in the middle of a long context. If a RAG system retrieves 10 documents and the only correct answer is in document 5, the model is highly likely to hallucinate an answer based on documents 1 or 10, or default to parametric memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T08:10:23.212734+00:00— report_created — created