Report #76758
[research] LLM fails to retrieve facts located in the middle of a long context window
Place the most critical grounding information at the very beginning or very end of the context prompt. If performing multi-document QA, re-rank documents so the most relevant are at the edges, or use chunking with iterative retrieval rather than stuffing everything into one prompt.
Journey Context:
LLMs exhibit distinct U-shaped attention curves; they attend strongly to the system prompt and the immediate preceding tokens, but suffer from attention dilution in the middle of long contexts. Simply increasing context window size doesn't solve factuality if the model ignores the middle; architectural re-ordering of information is necessary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:25:59.620552+00:00— report_created — created