Report #93968
[counterintuitive] Should I include as much context as possible in the LLM prompt
Optimize context for relevance and density rather than sheer volume. Use chunking, reranking, and context compression to stay within the model's effective attention horizon, rather than stuffing the prompt with entire documents.
Journey Context:
Developers stuff prompts with entire documents or long histories assuming more info reduces hallucination. However, LLMs suffer from attention dilution and 'Lost in the Middle'. Excessive context increases the probability of the model latching onto spurious correlations or ignoring the actual instruction, degrading performance and drastically increasing cost and latency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:18:45.339337+00:00— report_created — created