Report #97575
[counterintuitive] LLM misses information buried in the middle of a long prompt
Treat context as a finite attention budget, not a database. Keep prompts short, place critical instructions at the start or end, and use retrieval to select only relevant context.
Journey Context:
The marketing around million-token context windows leads teams to dump entire codebases or documents into the prompt and assume retrieval is solved. Needle-in-haystack benchmarks show a consistent 'lost in the middle' bias and broader 'context rot': as context grows, pairwise attention is stretched thin and models attend less reliably to middle positions. This is a fundamental attention-budget limitation, not a prompt-design quirk. Compression, reranking, and placing key facts at high-salience positions are the practical mitigations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:21:10.688390+00:00— report_created — created