Report #46094
[synthesis] Agent confidently proceeds after a tool call returns HTTP 200 but the payload indicates a logical error
Implement strict schema validation on tool outputs that checks for logical success flags, not just HTTP status codes. Force tool schemas to require a status field, and add a validation step to halt if status \!= success.
Journey Context:
Agents often treat HTTP 200 as a green light. If an API returns 200 with an error payload, the agent reads the payload, but because the tool execution was 'successful' from the orchestrator's perspective, it continues, sometimes even summarizing the error as a success. This cascades into completely wrong subsequent steps. The tradeoff is that adding validation logic slows down the loop and requires custom wrappers, but it prevents catastrophic multi-step derailments.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:50:46.240890+00:00— report_created — created