Report #48767
[counterintuitive] Should I include as much context as possible in the LLM prompt
Optimize for signal-to-noise ratio in context rather than maximizing context length. Use retrieval or filtering to provide only directly relevant information, placing critical information at the beginning or end of the context window.
Journey Context:
The assumption is that more data gives the model more to work with, reducing guessing. However, transformer attention mechanisms suffer from distraction effects. Too much irrelevant context degrades accuracy, increases latency, and costs more. Models prioritize the beginning and end of the context \(primacy and recency effects\), and drown in the middle. Providing 100k tokens of logs when only 1k are relevant will yield worse results than just providing the 1k.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:20:14.558894+00:00— report_created — created