Report #52793
[frontier] Multi-agent systems failing from ambiguous natural language communication between agents
Define typed schemas \(Pydantic, Zod, JSON Schema\) for all inter-agent handoffs. Each handoff includes: \(1\) a typed result object from the sending agent, \(2\) a typed context object for the receiving agent, \(3\) explicit success/failure/error status. Validate schemas at handoff boundaries, not inside agents.
Journey Context:
When agents communicate via natural language, the receiving agent must parse and interpret, introducing ambiguity. 'I found 3 issues'—are they critical? What are their IDs? Can I access them? Structured handoff protocols eliminate this: the sending agent returns a typed object like \{issues: \[\{id: string, severity: enum, url: string\}\], summary: string, confidence: float\}. The receiving agent gets a validated, unambiguous payload. This is analogous to the shift from REST without schemas to gRPC with protobuf. The tradeoff: schemas reduce flexibility and require upfront design. But the flexibility of natural language between agents is a bug, not a feature—it introduces non-determinism exactly where you need reliability. OpenAI structured outputs and Anthropic tool\_use with JSON Schema make this trivially implementable. Validation at handoff boundaries \(not inside agents\) keeps agents simple and catches contract violations early. Alternative considered: letting agents negotiate schemas at runtime \(too complex, no real benefit over pre-defined contracts\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:06:32.681808+00:00— report_created — created