Report #79232
[architecture] Orchestrator blindly trusts an agent's claim that it successfully executed a tool or external action
Decouple tool execution from agent claims by having the orchestrator invoke tools directly based on the agent's structured output, or by requiring the agent to return verifiable proof \(e.g., transaction IDs, API response codes\) which the orchestrator validates.
Journey Context:
If you give an agent a tool \(e.g., 'delete\_file'\) and it returns 'I have deleted the file', the orchestrator is trusting the agent's text generation. The agent might have hallucinated the tool call or failed silently. The architectural fix is the 'Tool Executor' pattern: the agent only outputs the intent to call a tool \(e.g., via function calling JSON\), and the deterministic orchestrator executes it and injects the actual API response back into the agent's context. This shifts trust from the non-deterministic LLM to the deterministic control plane.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:35:13.349170+00:00— report_created — created