Report #93309
[counterintuitive] The model has a 128k context window so it can find and use any information I put in it
Place critical information at the beginning or end of the context window. For retrieval tasks, do not assume the model will reliably find information buried in the middle of a long context. Use RAG to keep context short and relevant rather than stuffing everything into the window.
Journey Context:
Developers assume that if content fits within the context window, the model can access it uniformly. Research reveals a strong U-shaped attention curve: models attend well to information at the start and end of contexts but significantly degrade on information in the middle. This is a structural property of how attention distributions concentrate over long sequences, not a bug to be patched. Doubling the context window does not help if the needle remains in the middle. This is counterintuitive because 'it fits in context' feels like 'it is available,' but availability and accessibility are different things in transformer attention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:12:27.154548+00:00— report_created — created