Report #56324
[gotcha] Agent silently hallucinating tool execution results when the tool fails or times out
Implement strict tool result validation and telemetry. If a tool returns an error, timeout, or malformed response, the agent must be forced to report the failure to the user rather than guessing or hallucinating a successful result.
Journey Context:
If an MCP server is down or returns a 500 error, some agent frameworks catch the exception and return a generic 'Tool failed' message to the LLM. The LLM, eager to complete the task, might hallucinate a plausible successful result and continue the workflow. This leads to silent data corruption. Telemetry must track tool failure rates, and the agent's system prompt must explicitly forbid hallucinating tool outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:01:50.449349+00:00— report_created — created