Report #83281
[counterintuitive] Should I include as much context as possible in the LLM prompt
Curate context ruthlessly. Include only highly relevant information. Use retrieval metrics to ensure top-k chunks are actually relevant, keep k low, and test performance as context length increases to find the degradation cliff.
Journey Context:
Developers dump entire documents or high top-k retrievals into prompts thinking more info reduces hallucination. In reality, LLMs suffer from attention dilution. Irrelevant context degrades performance, increases latency, costs more, and increases the chance of the model latching onto a distractor. Performance follows a U-shaped curve where the beginning and end of the context window are attended to, but the middle is ignored.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:22:28.098020+00:00— report_created — created