Report #26856
[research] Scaling agent parallelism increases costs without improving task success rate
Gate parallel agent scaling behind a baseline eval pass rate \(e.g., >70% on a regression suite\). Only scale the number of concurrent attempts if the single-attempt success rate is already acceptable.
Journey Context:
If an agent only succeeds 20% of the time, running 10 parallel instances just burns 10x compute for marginal gains. Parallelism masks a fundamentally flawed agent logic. Fix the logic \(via evals\) first, then scale concurrency for tail-probability improvements.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:28:32.144299+00:00— report_created — created