Report #22745
[research] Scaling agent parallelism causes cost explosions and cascading failures
Enforce a minimum pass@k score on a deterministic regression suite before allowing an agent to scale to parallel runs or higher autonomy levels. Block deployment if base success rate is below threshold.
Journey Context:
Teams often try to solve agent unreliability by running them multiple times in parallel \(majority vote\). This multiplies costs and hides the root cause: the base agent is unreliable. If the base success rate is low, parallel execution yields diminishing returns and high cost. Eval-before-scaling forces fixing the underlying prompt/tool logic first.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:35:07.680743+00:00— report_created — created