Report #30807
[research] Adding new tools to an agent causes prompt bloat and degrades existing task performance
Run a regression eval suite against the agent before adding a new tool to the prompt context. If the new tool drops the accuracy of existing tasks below threshold, implement dynamic tool loading \(only inject tools relevant to the current sub-task\) rather than passing all tools in the system prompt.
Journey Context:
Agents suffer from context window pollution. Developers often add a tool and only test if the new tool works, ignoring the attention tax it places on existing tool selection. Eval-before-scaling forces you to measure this tax. Dynamic tool loading fixes it but adds orchestration complexity. The tradeoff is worth it: a smaller, focused context window drastically reduces tool-selection hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:05:29.403894+00:00— report_created — created