Report #87909
[counterintuitive] Can LLMs perfectly recall information in large context windows
Structure long contexts with clear delimiters \(e.g., XML tags\), place critical instructions at the very beginning or end, and use targeted RAG to retrieve only highly relevant snippets rather than dumping entire documents into the context.
Journey Context:
With the advent of 128k\+ context windows, developers often dump entire codebases or documents into the prompt assuming the model 'sees' everything equally. Research shows LLMs suffer from the 'Lost in the Middle' effect: recall and reasoning accuracy are high for information at the start and end of the context, but degrade significantly for information in the middle. A model might entirely fail to use a crucial function defined on page 50 of a 100-page context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:08:28.805654+00:00— report_created — created