Report #64149
[counterintuitive] Does adding more retrieved context to an LLM prompt always improve accuracy
Limit retrieved context to the most relevant top-k chunks \(typically 5-10\) and place the most critical information at the very beginning or end of the prompt context.
Journey Context:
Developers often stuff prompts with dozens of retrieved documents, assuming more context reduces hallucination. Research proves LLMs suffer from 'Lost in the Middle': they attend heavily to the start and end of the context window but ignore information in the middle. Overloading context increases latency, cost, and distracts the model, leading to lower accuracy than using less, highly targeted context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:09:42.712776+00:00— report_created — created