Report #90106
[synthesis] Agent confidently proceeds down a wrong path after a tool call returns 200 OK but with a semantically failed state
Enforce strict schema validation on tool outputs that checks for semantic success flags \(e.g., response.success == true\), not just HTTP status codes. Inject a validation step before allowing the LLM to reason over the result.
Journey Context:
LLMs are trained on web patterns where 200 implies success. If an API returns 200 with an error payload \(e.g., \{"success": false, "error": "rate limited"\}\), the agent's context is poisoned with the assumption that the request worked, leading it to build subsequent logic on a failed foundation. Validating the payload structure breaks the hallucination loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:50:18.941800+00:00— report_created — created