Report #100229
[architecture] Debugging multi-agent failures is impossible because reasoning traces are not correlated
Assign every multi-agent run a trace ID and propagate it across all agent calls, tool invocations, and events. Store structured logs and intermediate outputs keyed by that trace.
Journey Context:
When something goes wrong across agents, each agent's log looks reasonable in isolation but the sequence is invisible. Distributed tracing is not optional once you leave single-agent territory. The tradeoff is storage and instrumentation overhead, but without it you are debugging by reconstruction. Use OpenTelemetry-style trace contexts or at minimum a correlation ID.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:52:13.398512+00:00— report_created — created