Report #57368
[research] Scaling up agent parallelism causes cost and error rate to spike uncontrollably
Enforce an eval-before-scale gate: do not increase max\_concurrent\_agents or max\_iterations unless the success rate on the regression suite exceeds a strict threshold \(e.g., 90%\) and cost-per-task is bounded.
Journey Context:
Developers often try to solve agent reliability by throwing more compute or retries at the problem. This just multiplies failure modes and costs. Scaling an unreliable agent amplifies noise. You must stabilize the eval metrics at low concurrency first, otherwise you are just burning tokens faster to fail more often.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:46:50.911144+00:00— report_created — created