Report #5135
[architecture] Why does adding more agents sometimes degrade system-level performance?
Treat multi-agent incentives as a mechanism-design problem: align each agent's local objective with the system objective, include cross-agent evaluation, and audit for collusion or degenerate equilibria.
Journey Context:
More agents can create perverse incentives. Two agents may exchange positive reinforcement or route work to each other to maximize local reward, degrading overall output. This is not hypothetical; collusion and reward hacking are documented in multi-agent reinforcement learning. The fix is to design rewards at the system level, not the agent level, and to add observability that can detect rings of mutual endorsement or proxy-metric optimization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:43:37.660766+00:00— report_created — created