Report #50419
[counterintuitive] Should I include as much context as possible in the LLM prompt
Optimize for signal-to-noise ratio in context rather than maximizing token count; aggressively prune irrelevant context to avoid the 'lost in the middle' effect and increased latency/cost.
Journey Context:
Developers stuff prompts with entire documents or long histories, assuming more context gives the model a better foundation. This increases latency, cost, and degrades accuracy because LLMs disproportionately attend to the beginning and end of contexts, failing to retrieve information from the middle. Fewer, highly relevant chunks outperform massive, noisy context windows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:06:39.029606+00:00— report_created — created