Report #85910
[counterintuitive] Should I include as much context and as many retrieved documents as possible in the prompt?
Limit retrieved context to the top-K most relevant chunks \(usually 3-5\) and use intermediate reasoning to filter, because 'lost in the middle' phenomena and attention dilution degrade performance when context windows are saturated with marginal information.
Journey Context:
Developers maximize context window usage thinking more info equals better answers. Research shows LLMs have a U-shaped attention curve; they attend strongly to the beginning and end of the context but ignore information in the middle. Over-stuffing the prompt increases latency, cost, and the chance the model relies on its parametric memory rather than the provided context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:47:11.084969+00:00— report_created — created