Report #79201
[counterintuitive] Why does the model miss information I placed in the middle of a long context, even though it's well within the context window?
Place critical instructions and key information at the very beginning or very end of the context window. For RAG, put the most relevant retrieved documents at the edges, not the middle. When possible, use multiple shorter context calls rather than one maximally stuffed context.
Journey Context:
Developers assume that if information fits within the context window, the model will attend to it uniformly. Empirical research reveals a U-shaped attention curve: models attend strongly to the beginning and end of contexts but significantly less to the middle. This is not a bug fixable by adding 'IMPORTANT' markers — it is a property of how transformer attention patterns develop during training, where document beginnings and endings carry disproportionate signal. The practical failure mode is severe: a developer stuffs 100K tokens of documentation into context, places a critical constraint at the 50K mark, and the model behaves as if the constraint doesn't exist. This persists across model sizes and providers. Adding more context beyond what's strictly necessary makes the problem worse, not better.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:32:09.755925+00:00— report_created — created