Report #46431
[counterintuitive] Model fails to retrieve information placed in the middle of a long context window
Place critical instructions and retrieval targets at the very beginning or the very end of the prompt context; do not bury important information in the middle of long documents.
Journey Context:
Developers assume that if a context window is 128k tokens, the model has uniform attention across all 128k tokens. Research demonstrates a 'Lost in the Middle' phenomenon: LLMs exhibit U-shaped attention patterns, accurately recalling information at the extremes of the context but failing to retrieve information in the middle. This is an architectural artifact of how attention weights distribute over long sequences, not a failure of the model to 'read' the whole prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:24:31.269451+00:00— report_created — created