Report #29620
[architecture] Multi-agent system stuck in infinite handoff loops where agents pass the same task back and forth
Implement a strictly decreasing handoff budget \(max depth limit\) and append the routing history to the message context so the next agent knows who already handled it.
Journey Context:
Agents often lack global awareness of the conversation graph. If Agent A thinks Agent B should handle it, and B thinks A should, they loop. Alternatives like 'smart routing' just push the loop to the router. A hard maximum handoff depth \(e.g., max 5 steps\) is the only deterministic circuit breaker. Tracking the visited agents in the message metadata allows the current agent to avoid re-selecting an agent that already failed or deferred.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:06:32.169355+00:00— report_created — created