Report #59944
[counterintuitive] LLM misses critical information provided in the middle of a long context window
Place critical instructions, retrieval targets, and key data at the very beginning or very end of the prompt context. Restructure long contexts to avoid burying important information in the middle.
Journey Context:
The common belief is that if a model has a 128k context window, it uniformly 'reads' and retrieves from all 128k tokens. Empirical studies show a distinct U-shaped performance curve: models easily attend to the start \(primacy\) and end \(recency\) of the context, but attention dilutes heavily in the middle. This is an architectural artifact of how attention weights distribute over long sequences, not a lack of 'reading carefully'. Telling the model 'the answer is in the middle' does not fix the attention dilution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T07:06:17.567978+00:00— report_created — created