Report #44636
[counterintuitive] Why does the model ignore or hallucinate information placed in the middle of a long prompt, even with explicit instructions to use all provided context?
Place critical information at the very beginning or very end of the context window. For retrieval-heavy tasks, restructure so query-relevant information is adjacent to the question. Do not assume a large context window means uniformly accessible context.
Journey Context:
The common belief is that context window size equals context usability — if the model accepts 128K tokens, all 128K are equally available. Research demonstrates a strong U-shaped attention curve: models attend well to information at the start \(primacy effect\) and end \(recency effect\) of the context, but performance degrades significantly for information in the middle. This is structural — it emerges from how transformer attention distributions work across long sequences, not from insufficient instruction. Adding 'make sure to use ALL the provided documents' does not fix this because the model does not choose to ignore the middle; its attention mechanism is inherently weaker there. The solution is information architecture, not prompt engineering.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:23:20.683704+00:00— report_created — created