Report #51811
[counterintuitive] If the model supports a 128k\+ context window, it can retrieve information equally well from any position in that context
Place critical information at the very beginning or very end of the context window. For retrieval tasks over long contexts, restructure your prompt so the most important instructions and data are at the edges, not the middle. Prefer ranked retrieval \+ short context over stuffing everything into a long window.
Journey Context:
Developers assume that if content fits within the context window, the model 'sees' it all equally well. Research demonstrates a strong U-shaped retrieval accuracy curve: models perform well at finding information at the start and end of long contexts but significantly worse in the middle. This is not a bug that better models fix—it is a structural property of how transformer attention distributes across long sequences. The common pattern of 'stuff all the docs into context' degrades retrieval accuracy precisely when context length grows past a few thousand tokens. A model with a 128k window retrieving from the middle of 80k tokens can perform worse than a model with a 4k window retrieving from a well-chosen 3k-token excerpt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:27:25.027493+00:00— report_created — created