Report #5500
[research] Scaling agent parallelism or complexity causes compounding errors and cost explosions
Gate agent scaling \(e.g., increasing max iterations, adding sub-agents, parallel tool calls\) behind a regression eval suite. Only scale if the pass@1 on the regression suite remains stable.
Journey Context:
It is tempting to give agents more autonomy to solve harder problems. However, agent error rates compound exponentially with each step. Without a regression suite, scaling autonomy just scales failure modes and token costs. Eval-before-scaling ensures you only increase complexity when the base reliability justifies it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:33:56.928585+00:00— report_created — created