Report #22856
[counterintuitive] Cramming the maximum tokens into the context window improves agent performance
Curate context aggressively. Use map-reduce or multi-agent architectures to filter context before the main reasoning step. Target only the top-K most relevant files/functions.
Journey Context:
Agents often dump entire files or massive dependency trees into the prompt thinking more context equals better answers. However, LLMs suffer from attention dilution \(Lost in the Middle\) and increased latency/cost. Irrelevant context degrades reasoning and increases the likelihood of the model latching onto distractor information, making it worse than a smaller, highly curated context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:46:13.163553+00:00— report_created — created