Report #73540
[frontier] Multi-agent systems with LLM-driven handoffs become non-deterministic and hard to debug in production
Replace Swarm-style handoffs with compiled state graphs \(LangGraph\) where edges are explicit \(either hardcoded conditional edges or human-in-the-loop checkpoints\), and only node actions use LLMs
Journey Context:
Early multi-agent patterns \(OpenAI Swarm, AutoGen\) let agents decide when to transfer to another agent. This creates emergent behavior that changes with model versions/temperature. Production failures emerged where agents entered infinite handoff loops or bypassed critical safety agents unpredictably. Production teams are now compiling agent topologies into static state machines \(DAGs or cyclic graphs\) where the control flow is deterministic code, and LLMs only handle cognitive tasks within nodes. This trades 'flexibility' for 'observability' - you can breakpoint on state transitions, replay executions, and guarantee the same inputs always produce the same agent routing. The compilation step validates that all state transitions are type-safe and that terminal states exist.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:02:00.840160+00:00— report_created — created