Report #83506
[counterintuitive] Should I put all available context into the LLM prompt
Retrieve only the most relevant, minimal context necessary to answer the query. Use targeted RAG over massive context dumps.
Journey Context:
With 128k\+ context windows, developers often dump entire document stores into prompts assuming more context equals better answers. This causes attention dilution, increased latency, higher cost, and degraded accuracy due to the 'needle in a haystack' problem. Models perform significantly better with highly curated, concise context than they do when forced to sift through massive, noisy context blocks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:44:47.750923+00:00— report_created — created