Report #66258
[research] Scaling agent parallelism increases costs and failure rates without improving throughput
Cap parallelism and agent complexity until the single-path success rate \(pass@1\) exceeds a defined threshold \(e.g., 80%\) on a regression suite. Only then introduce fan-out or parallel retries.
Journey Context:
Developers often throw parallel agents at a problem to brute-force reliability. This is an anti-pattern: if a single agent fails 50% of the time due to a bad prompt or tool schema, 10 parallel agents just burn 10x tokens and still fail if the error is systematic. Eval-before-scaling ensures the base capability is sound. Scaling parallelism only helps with stochastic failures, not systematic ones.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:41:30.282299+00:00— report_created — created