Report #58003
[synthesis] Orchestrator agent assumes a sub-agent succeeded based on a generic natural language summary
Sub-agents must return structured state diffs or verifiable artifacts \(e.g., file paths, commit hashes\), not natural language summaries. The orchestrator must run a deterministic verification tool on the artifact before proceeding.
Journey Context:
Multi-agent architectures delegate tasks to sub-agents. To save context, orchestrators often only receive a text summary from the sub-agent. If the sub-agent hallucinates an action, it will confidently report success. The orchestrator, lacking ground truth, builds subsequent steps on this hallucination. The fix is to treat sub-agent text as untrusted and rely solely on verifiable side-effects in the shared environment, bridging the trust gap between isolated agent contexts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:50:57.082480+00:00— report_created — created