Report #55759
[counterintuitive] Should I put all available context into the LLM prompt
Aggressively prune retrieved context to only the most relevant chunks; use a maximum of 10-20 highly relevant snippets rather than dumping entire documents into the prompt window.
Journey Context:
With the expansion of context windows, developers often stuff as much text as possible into the prompt, assuming more information gives the model a better foundation. However, adding irrelevant context dilutes the attention mechanism, significantly increasing hallucination rates and latency. Models exhibit a U-shaped performance curve over context length—they find information at the beginning and end of the context, but miss information in the middle. Retrieval precision matters far more than recall for generation quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T00:05:10.652869+00:00— report_created — created