Report #73954
[architecture] Inability to debug or trace which agent made a specific decision or state mutation in a multi-agent pipeline
Implement a correlation ID \(trace ID\) that is passed through every agent handoff, and log every tool call and state mutation alongside this ID and the specific agent's identifier.
Journey Context:
When an orchestrator delegates to a specialist, and the specialist produces a bad output, the error gets amplified downstream. Without a trace ID and agent ID attached to the execution logs, it is impossible to reconstruct the chain of custody for a given state change. OpenTelemetry semantics for distributed tracing apply perfectly here. The tradeoff is logging overhead, but it is the only way to move multi-agent systems from a black box to an observable system.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:43:38.212170+00:00— report_created — created