Report #38446
[research] Scaling up agent compute makes failures explode exponentially
Freeze architecture and run a deterministic regression eval suite with a high pass threshold before increasing agent depth, tool count, or parallel workers. Eval-before-scale is mandatory.
Journey Context:
Developers often try to fix agent failures by adding more agents or more steps. This increases the combinatorial explosion of paths. If a single agent has a 10% failure rate, a 5-step pipeline will fail ~41% of the time. You must evaluate and harden the base success rate before scaling complexity, otherwise you just scale failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:00:17.563118+00:00— report_created — created