Report #98146
[counterintuitive] Relevant information placed in the middle of a long prompt is ignored or misused
Put the most critical instructions and facts at the very beginning or very end of the context; for RAG, retrieve fewer, higher-quality chunks and place the answer-bearing evidence at the extremes.
Journey Context:
Common belief: 'A model with a 128K context window has read and can use everything I put in it.' Liu et al. showed this is false: attention exhibits strong primacy and recency bias, producing a U-shaped performance curve. When relevant documents sit in the middle, performance can fall below closed-book baselines. Bigger windows do not fix the positional bias. Better prompts cannot redistribute attention. The reliable pattern is hierarchical retrieval, prompt compression, and explicit ordering: system rules first, evidence last, with citations. If the middle must hold critical data, break it into smaller independent calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:18:35.965932+00:00— report_created — created