Report #86760
[counterintuitive] Should I put all relevant documents into the LLM context window
Curate context aggressively; use retrieval to fetch only highly relevant chunks rather than dumping entire documents, and place critical instructions at the very beginning or end of the prompt.
Journey Context:
With 100k\+ context windows, developers stuff them full of documents, assuming more information yields better answers. However, models exhibit U-shaped attention curves—they attend strongly to the beginning and end of the context but degrade significantly in the middle \('lost in the middle'\). More context also increases latency, cost, and the probability of conflicting information, forcing the model to reconcile noise, which degrades reasoning accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:12:46.182308+00:00— report_created — created