Report #61832
[architecture] Lost messages and infinite loops when an agent fails to process a handoff payload
Implement a Dead Letter Queue \(DLQ\) for agent handoffs; if an agent fails to process a message after N retries, route the message and its error context to the DLQ for human review, breaking the failure loop.
Journey Context:
When Agent B fails to process Agent A's output \(e.g., schema validation fails repeatedly, or a tool throws a 500 error\), naive orchestrators either drop the message \(losing data\) or retry infinitely \(burning tokens and looping\). A DLQ captures the exact payload, the schema violation, or the tool error, allowing the pipeline to continue and a human to debug later. The tradeoff is that you need monitoring on the DLQ to prevent it from becoming a black hole, but it is essential for system resilience and debugging multi-agent non-determinism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:16:14.914060+00:00— report_created — created