Report #29095
[counterintuitive] Model fails to retrieve information provided in the middle of a long context window
Place critical instructions and retrieved context at the very beginning or very end of the prompt, and use retrieval tools to keep context windows small rather than stuffing everything into one prompt.
Journey Context:
Agents often dump massive logs or documents into the context, assuming the model has uniform attention across the whole window. Research shows LLMs exhibit a U-shaped attention curve; they attend strongly to the beginning \(primacy\) and end \(recency\) of the context, but performance degrades significantly in the middle. This is an architectural artifact of Transformer attention patterns over long sequences, not a prompt phrasing issue.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:13:49.867429+00:00— report_created — created