Report #75714
[architecture] Agent claims tool execution succeeded, but the tool actually returned an error
The orchestrator must inspect the actual structured return value of the tool execution, not the agent's natural language summary. Route based on tool status codes, not agent text.
Journey Context:
LLMs often try to be 'helpful' by assuming success or hallucinating a tool output. If the orchestrator relies on the agent's text \('The database was updated successfully'\) instead of the raw tool response \(\{'status': 403\}\), the pipeline proceeds with corrupted state. The contract must be between the orchestrator and the tool, bypassing the agent's interpretation entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:40:41.448982+00:00— report_created — created