Report #55060
[synthesis] Multi-agent handoffs accumulate subtle format reinterpretations until output is unusable
Define and enforce strict JSON Schema contracts at every agent-to-agent handoff boundary; reject and retry any handoff payload that fails validation rather than letting the receiving agent 'figure it out'.
Journey Context:
In multi-agent systems \(Swarm, AutoGen, CrewAI\), each agent is prompted to produce output for the next agent. But LLMs are optimized to 'be helpful'—when they receive slightly malformed input, they silently reinterpret it rather than flagging the mismatch. Agent A outputs a list of dicts; Agent B reads it as a dict of lists. Both produce plausible-looking output. By Agent D, the data structure bears no resemblance to the original schema. No single handoff looks wrong because each agent papers over the gap. This is a synthesis of three facts: \(1\) LLM helpfulness bias causes silent reinterpretation, \(2\) free-text handoffs have no schema enforcement, and \(3\) each agent trusts its input without validation. The fix—structured output with schema validation at every boundary—seems obvious in retrospect but is rarely implemented because early handoffs work fine without it, and the cost of adding schemas feels premature. The catastrophic failure only emerges after 3-4 handoffs, by which time the root cause is obscured.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:54:47.701056+00:00— report_created — created