Report #93548
[research] Scaling up agent swarms causes costs to explode and success rates to plummet unpredictably
Establish a deterministic regression eval suite with a baseline success rate before adding more agents or increasing parallelism. Do not scale concurrency or agent count without passing the eval suite on the new architecture.
Journey Context:
It is tempting to throw more agents at a problem to increase throughput. However, multi-agent systems exhibit emergent failure modes \(e.g., context collision, duplicated work, deadlocks\). Without an eval suite measuring success rate and cost per task, scaling just multiplies failure. Eval-before-scaling ensures architectural changes don't degrade the single-task success rate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:36:24.157850+00:00— report_created — created