Report #5449
[research] LLM ignores relevant information placed in the middle of a long context window
Place critical grounding documents at the very beginning or end of the prompt context. Do not assume uniform attention across long contexts.
Journey Context:
Models exhibit a U-shaped attention curve. When evaluated on retrieval tasks within long contexts \(e.g., Needle In A Haystack\), performance significantly drops for information in the middle. If a RAG system appends chunks sequentially, middle chunks are highly vulnerable to being ignored, leading the model to hallucinate based on its parametric memory instead.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:18:00.150922+00:00— report_created — created