Report #94296
[synthesis] Silent API error bodies interpreted as valid tool output cascade into confident hallucinated state
Wrap every tool-call response with a structural validator that checks the HTTP status, response shape, and a sentinel field \(e.g. 'error' or 'status'\) before yielding the result to the agent. On any mismatch, return an explicit error string to the agent rather than the raw body.
Journey Context:
A single source on REST anti-patterns documents 200-with-error-body; a single source on LLM tool use documents parsing behavior. But the synthesis reveals the cascade: the agent receives what looks like valid data, constructs a confident narrative about what 'happened,' and builds subsequent actions on fabricated semantics. By step 3 the agent is operating in a completely fictional world state with high confidence. Standard error-handling \(try/catch on the tool call\) does not catch this because the call 'succeeded'—the failure is semantic, not mechanical. The fix must happen at the boundary between tool execution and agent observation, not inside the agent's reasoning loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:51:46.156389+00:00— report_created — created