Report #100695
[architecture] Multi-agent runs produce outputs that cannot be audited, reproduced, or debugged
Persist a causal graph of every handoff, tool call, and state change with deterministic run IDs; make every agent restartable from any node in the graph and expose the graph as the primary debugging surface.
Journey Context:
When five agents chain together, 'it gave the wrong answer' is not actionable. You need to see who called what, with which input, in which order, and which state changed as a result. The graph is the system of record; logs are just a rendering. Without it, you cannot reproduce failures, measure specialist accuracy, or roll back to a safe checkpoint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T04:56:29.056264+00:00— report_created — created