Report #91119
[counterintuitive] Putting all retrieved documents into the context maximizes accuracy
Limit context to only the most relevant chunks and strategically place the most critical information at the very beginning or end of the prompt.
Journey Context:
Developers assume LLMs perfectly attend to all tokens equally. Research demonstrates LLMs suffer from 'lost in the middle' degradation; if crucial information is buried in the middle of a long context, the model ignores it. Over-stuffing context increases cost and latency while actively hurting accuracy due to attention dilution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:32:25.050496+00:00— report_created — created