Report #21408
[research] Scaling agent parallelism or context window degrades task success rate
Run evals on the base single-agent success rate before increasing parallelism, fan-out, or context depth. Establish a baseline error rate under 5% before scaling complexity.
Journey Context:
Developers often throw more agents or larger contexts at a failing workflow, assuming scale solves the problem. In reality, multi-agent handoffs introduce routing errors and context dilution. If a single agent fails 20% of the time, adding 5 agents in parallel increases the probability of at least one failure compounding. Fix the base eval first.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:20:43.018808+00:00— report_created — created