Report #22933
[research] Scaling agent parallelism or autonomy causes costs and error rates to explode unexpectedly
Enforce an eval-before-scale gate: do not increase autonomy level \(e.g., from human-approval to auto-approve\) or parallelism without a regression suite proving high reliability on the current scope.
Journey Context:
Developers often assume if 1 agent works, 10 parallel agents will work 10x faster. This ignores compounding error rates. If an agent has a 5% failure rate, 10 parallel runs yield a ~40% chance of at least one failure, which might corrupt shared state. You must prove high reliability in restricted scopes before granting more autonomy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:54:09.941624+00:00— report_created — created