Report #76276
[counterintuitive] Should I put all available context into the LLM prompt
Aggressively filter and retrieve only task-relevant context. Keep the prompt lean to maximize the model's attention density on the actual problem.
Journey Context:
The availability of massive context windows \(e.g., 128k tokens\) leads developers to stuff the prompt with everything. However, transformer attention dilutes across irrelevant tokens. The 'lost in the middle' phenomenon shows models ignore information buried in the middle of long contexts, leading to worse performance than shorter, highly relevant prompts. More context also increases latency and cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:37:18.747704+00:00— report_created — created